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RESISTANCE ELEMENTS AND USES THEREFOR 

FIELD OF THE INVENTION 
[00011 THIS INVENTION relates generally to pathogenic resistance. More 
particularly^ ttie present invention relates to polynucleotide and polypeptide sequences 
5 involved in the resistance mechanism of plants to patbiogens» especially fungal 
pattiogens. The present invention also relates to the use of &ese sequences for 
modulating plant resistance and for producing genetically modified plants having 
modified pathogen resistance characteristics. 

[00021 Bibliographic details of certain publications referred to by author in this 
10 specification are collected at the end of the description. 

BACKGROUND OF THE INVENTION 
10003] Banana is one of the world's most inQKHtant finit oops with a world 
production of qpinroximately 98 million tonnes annually (FAD, 2001). However, as with 
many monocultures, banana is affected by a range of fimgal^ viral, bacterial and 
15 nematode diseases, which cause sev^e economical losses every year. 

[0004] Fusarium wilt is one of the most destructive and notorious diseases of 
banana. It is also known as Panama disease, in recognition of the extensive damage it 
caused in export plantations in this Central American country. By 1960, Fusarium wilt 
had destroyed an estimated 40,000 ha of *Gros Michel* (AAA), causing the export 

20 industry to convert to cultivars in the Cavendish subgroup (AAA) (Ploetz and Pegg, 
2000). Fusarium wilt is caused by the soilbome hyphomycete, Fusarium oxysporum. 
Schlect. f. sp. cubense. It is one of more than 120 fomaae speciales (special forms) of 
axysporum that cause vascular wilts of flowering plants. This pathogen affects species 
of Musa and Heliconia^ and strains have been classified into four physiological races 

25 based on pattiogenicity to host cultivars in the field (race 1, *Gtos Michel'; race 2, 
*Bluggo6^; race 3, Heliconia spp.; and race 4, Cavradish cultivars and all cultivars 
susceptible to race 1 and 2). Until recently, race 4 had only been recorded to cause 
serious losses in the subtropical regions of Australia, South Afiica, the Canary Islands, 
w& Taiww. If this race were to become established in the Americas, tfie world export 

30 industries would be severely affected, as there is no widely accepted replacraient for 
Cavendish cultivars (Bentley et oL, 1998). 
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[0005] In general, eJEfeotive chemical control measures do not exist. In work 
conducted in South Afiica, meth>4 bromide significantly reduced disease incidence, but 
was effective for only three years due to recolonisation of the fumigated areas by the 
pathogen. Studies on the biological and culture control of this disease have begun only 
5 recently, Arbuscular mycorrtuzal fimgi have been shown to reduce disease severity in 
short-term green house studies, but results from long tenn field studies have not been 
reported (Ortiz et al,, 1995). Tissue-culture plantlets are free of pathogens and diould be 
used to establish new plantings whenever possible. However the expense of plantlets 
may make their use fai subsistence agriculture inqjractical. Genetic resistance offers the 
10 greatest opportunity for managing this disease in infested soils (Ortiz er a/., 1995). 

[0006] Plants recognise and resist many invading pathogens by inducing a rapid 
defence response, tamed the hypersensitive response (HR). The HR results in localised 
cell and tissue deatii at the site of infection, which constrains fiuther spread of the 
mfection. This local response often triggers non-specific resistance throughout the plant, 

15 a phenomenon known as systemic acquired resistance (SAR). Once triggered, SAR 
provides resistance to a wide range of pathogens for days. The HR and SAR d^end on 
uiteraction between a dominant or semidominant resistance gene (R) product in the 
plant and a corresponding dominant phytopathogen avirulence gene (Avr) product 
(Baker et aL, 1997). A loss or alteration to either the plant R gene or the pathogen Avr 

20 gene leads to disease (con^atibility) (Hammond-Kosack and Jones, 1997), 

[00071 The R proteins provide resistance to pathogens as diverse as fungi, bacteria, 
viruses, nematodes and insects. Ei^t classes of R genes have been defined according to 
ttie structural characteristics of their predicted protein: (1) Cytoplasmic toxin reductase 
enzymes; (2) intracellular protein kinases; (3) receptor kinase-like protein with two 

25 tandem protein kinase domain; (4) receptor-like protein kinases with an extracellular 
leucine-rich repeat (LRR) domain; (5) intracellular LRR proteins with a nucleotide 
binding site (MBS) and leucine zipper (LZ) motif; (6) intracellular NBS-LRR proteins 
with a le^on with similarity to the Toll and interleukin-1 receptor (TIR) proteins; (7) 
LRR protems that encode membrane-bound exfaracellular proteins; and (8) LZ proteins 

30 that encode membrane-bound intracellular proteins (Figure 1). With a few exceptions, 
all R genes have been cloned by a map-based cloning approach. 

[OOOSJ The NBS'LRR class is by far the largest groiq> of resistance protems with 
more tfian 30 cloned genes to date. Two subgroups within the NBS-LRR class have 
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been recognised by the presence or absence of an amino N-terminal region (TLR 
domain) with amino acid sequence similarity to the cytoplasmic signalling domains of 
the Toll and interleu3dn-l recqitors (Meyer et al, 1999; Pan aL. 2000). 

[0009) The N-temunal of some MBS-IltR proteins is sunilar to the cytoplasmic 
5 effector domain of flie Drost^hila melanogaster and hunan TOLL and interleukin-1 
receptors (the TIR domain)(Hanimond4cosack and Jones, 1997). Other NBS-LRR 
proteins have different N-tenninal domams, which oaai contain putative leucine-zipper 
(LZ) motifs. Mutational analysis in Arabidopsis revealed that TIR-NBS-LRR and LZ- 
NBS-LRR proteins employ different signalling pathways. Ftoteins in the TIR effector 

10 domain signal via a pathway that includes EDSl, a predicted Kpase, whereas most LZ- 
NBS-LRR proteins exammed onploy flie membrane-associated M>R1 protein (Aaits et 
id.. 1998). Hiere is no ai^arent correlation between pattiogen type and the NBS-LRR 
subclass used by plants to detect these pathogens (Ellis and Jones 1998). All this 
evidence is consistent with the hypothesis of Aaits et al., (1998), who suggested that 

15 there may be two downstream pathways triggered by R genes, with the structure of die 
R protein detennining whidi downstream &ctors are required. Other recent results have 
shown that the situation may not be this simple. Two R genes fixjm Arabidopsis, RPP8 
and RPP13 (both LZ-NBS-LRR proteins), require neither EDSl nor NDRl, suggesting 
that there is at least a tfjird pathway for the traasduction of R-gene signals (Glazebiook, 
20 2001). Although many studies on diffwent R genes have suggested that the R-protem 
LRR domain makes the major contribution to the unique recognition capacity of 
individual R genes, recent analyses of the L allelic series has shown that the TIR 
domain can also conbibute to this capacity. Thus, it is possible that the LRR are 
necessary but not sufficient for the specific recognition of Avr protefaia and that LRR 
25 and amino-terminal domains have co-evolved to function in a coordinate manner. 
(Zachary, 2001). 

[0010] The central NBS domain comprises three moti& predicted to bind ATP or 
OTP, and several conserved motifs whose functions are not known (Hammond-Kosack 
and Jones, 1997). This region has homology to two activators of apoptosis in animal 
30 cells: APAF-l and CED. By analogy to these well-chaiacterised regulators of 
programmed ceU death, the corresponding domam in NBS-LRR proteins might operate 
as an intramolecular signal transducer (Van der Biraen and Jones, 1998; Aravind et al, 
1999). Domain sw^ involving several flax L alleles reveal a requirement for 
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intramolecular interactions and, thus, NBS-LRR proteins mi^t serve as ad2q)tor 
molecules that link recognition and signal delivery* For example, Avr signals perceived 
by the LRR mi^t initiate nucleotide hydrolysis at the NBS domain. This mi^t provide 
the energy necessary for a conformational change in the NBS-LRR protein, exposing ite 
5 N-tenninal effector portion, to trigger a defence response (Van der Biezen and Jones, 
1998), 

lOOll] LRR domain is thought to be involved in ligand-binding and pathogen 
recognition. LRR contain leucines or other hydrophobic residues at regular intervals and 
can also contain regularly spaced prolines and asparagines (Bent, 1996). Comparative 
10 analyses of the LRR domain shows hypervariability suggesting diversification due to 
selection pressures, Hiis indicates that recognition specificity resides in this part of the 
LRR. By anal>^es of in vivo and in vitro generated recombinants between difierent flax 
L alleles, Ellis et al (1997) confirmed experimentally that the LRR constitute the 
principal determinant of specificity for Avr products. Differential specificities of R 
15 proteins are often associated with duplications, deletions and sequence exchanges 
withm the regions that encode the LRR. Recently, flie LRR-like domain of the rice 
resistance protein Pita was shown to be required for interaction with Avr-Pita in the 
yeast two-hybrid systrai. Furthermore, mutation in either Avr-Pita Pita that abolished 
resistance also abolished the interaction in vitro. This is the first demonstrated 
20 interaction between an LRR-containing R protein and its cognate Avr protein (Jia et al., 
2000). 

[00121 Some of the r^istance genes isolated to date have been transferred to 
susceptible cultivars of die same species or different species wi A successful results. For 
example, the N gene for resistance to Tobacco mosaic virus (TMV) has been transferred 

25 to tomato and gives resistance in this species to TMV (Whitham et al., 1996). The Bs2 
gene, which encodes Xanthomonas resistance in p^per, has been cloned and transferred 
to tomato, a crop species in which the number of usefiil resistance genes to this 
pathogen is limited (Tai et al, 1999). However, die RPS2 gene fix)m Arabidopsis is 
non-functional in transgenic tomato and this phenommon has been re&ried to as 

30 'restricted taxonomic functionality' (Tai et al, 1999). These data suggest tfiat tibere may 
be difficulties in wide, cross-species resistance-gene transfer, in certain instances, due to 
R gene specificity (Ellis al, 2000). 
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[0013] The ability to isolate and transfer R genes eliminates the issue of retention 
of unwanted and genetically linked gennoplasm, an important problem associated with 
classical breeding. Although disease-resistance transgenic plants are no yet available 
commercially, future product development se^ns likely as our current level of 
5 understanding of pathogenesis and plant defence improves (Stuiver and Custers 2002). 

[0014] Despite the progress in R gene biology, however, no resistance genes have 
been isolated to date, which can confer resistance to destructive banana diseases in 
susceptible cultivars. 

[0015] In work leading up to the present invention, four genotypes of banana were 
10 investigated to identify candidate R genes that would confer resistance to race 4 of 
Fusarium oxysportan fsp cubense. These genotypes were as follows: Cavendish, which 
is resistant to race 1 but susceptible to race 4; Calcutta 4» which is resistant to race 1 and 
race 4; three progeny of Musa acuminata spp malaccensis, which are susceptible to race 
4; and three progeny of Musa acuminata spp malaccensis^ which are resistant to race 4. 
IS Five &miltes of R genes were identified from Oils investigation, all of which were 
present in the genomes of each of die genotypes but which had slightly diff^ent 
sequences. Suiprisingly, two of these fionilies (ROA2 and RGAS) were found to share 
some sequence similarity with the 12 R gene, whidi confers resistance to Fusarium wilt 
in tomatoes. Li addition RGA2 was shown to be transcribed in the three resistant Musa 
20 acuminata spp malaccensis progeny but not in ttie Ihree susceptible progeny. These 
discoveries have hem reduced to practice in compositions and methods for modulating 
disease resistance, especially fungal resistance, in plants including banana and in plants 
and plant parts, especially genetically modified plants, plant cells, tissues and seeds, 
having modified disease resistance, as described hereafter. 
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SUMMARY OF THE INVENTION 
[0016] Accordingly, in one aspect, the present invration provides isolated 
polynucleotides that confer disease resistance to a plant, especially resistance to 
diseases caused by fungal pathogens. These polynucleotides are generally selected 
S from: (a) a polynucleotide comprising a nucleotide sequence ttiat encodes a polypeptide 
conferring disease resistance to a plant, the sequence sharing at least 30% sequence 
identity with the sequence set forth in SEQ ID NO: 1 or 3, or a complement thereof; (b) 
a polynucleotide comprising a portion of at least 300 contiguous nucleotides of the 
sequence set forth in SEQ ID NO: 1 or 3, or a complement thereoj^ wherem fbs portion 

10 encodes a polypeptide that confers disease resistance to a plant; (c) a polynucleotide 
comprising a nucleotide sequmce that encodes a polypeptide comprising the amino acid 
sequence set forth m SEQ ID NO: 2 or 4; (d) a polynucleotide comprismg a nucleotide 
sequence that encodes a portion of a polypeptide comprising the amino acid sequence of 
SEQ JD NO: 2 or 4, wherein the portion retains the ability to confer disease resistance 

15 to a plant and con4)rises at least 100 contiguous amino acid residues of SEQ ID NO: 2 
or 4; (e) a polynucleotide comprising a nucleotide sequence that encodes, a polypeptide 
that shares at least S0% sequence similarity witti at least a portion of at least 300 
contiguous amino acid residues of tiie sequence set forth in SEQ ID NO: 2 or 4, wherein 
the polyp^tide confers disease resistance to a plant; (f) a polynucleotide comprising a 

20 nucleotide sequence that mcodes a polypeptide that confers disease resistance to a 
plant, wherein the polynucleotide hybridises to the sequmce of (a), (b), (c), (d), (e) or a 
conq>lement thereof under at least low stringency conditions; and (g) a polynucleotide 
comprising a portion of at least IS contiguous nucleotides of SEQ ID NO: 1 or 3, or a 
complement thereof wherein flie portion hybridises to a sequence of (a), (b), (c), (d), (e) 

25 or a complement thereof under at least low stringeaicy conditions. 

[0017] In another aspect, the present invention provides nucleic acid constructs for 
conferring disease resistance to a plant These constructs generally comprise a 
polynucleotide as broadly described operably connected to a regulatory element, which 
is operable in the plant. In certain embodiments, the construct is in the form of a vector, 
30 especially an expression vector, 

[0018] In yet another aspect, the present invention provides isolated host cells 
containing a nucleic acid construct or vector as broadly described above. In certain 
advantageous embodiments, the host cells are plant cells. In some embodiments, the 
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plant cells have the nucleic acid construct or expression vector stably incorporated into 
their nucleome, especially their genome. 

[0019] In still another aspect, the present invention provides plants containing a 
nucleic acid construct or expression vector as broadly described above. In certain 
S desirable embodiments, the plants have the nucleic acid construct or esqiression vector 
stably incorporated into the nucleome, especially, the genome of their cells. 

[0020] Tn a further aspect, the present invention provide probes for interrogating 
nucleic acid for the presrace of a disease resistance conferring polynucleotide or portion 
thereof. These probes generally comprise a nucleotide sequence that hybridises under at 

10 least low stringency conditions to a polynucleotide as broadly described above. In some 
embodiments, the probes consist essentially of a nucleic acid sequence which 
coiresponds or is complementary to at least a portion of a nucleotide sequence encoding 
the amino acid sequence set forth in SEQ ID NO: 2 or 4, wherein the portion is at least 
IS nucleotides in length. In other embodiments^ the probes comprise a nucleotide 

IS sequence which is capable of hybridising to at least a portion of a nucleotide sequmce 
encoding ftie amino acid sequence set forth in SEQ ID NO: 2 or 4 under at least low 
stringency conditions, wherein the portion is at least IS nucleotides in length. In still 
other embodiment, the probes comprise a nucleotide sequence that is capable of 
hybridising to at least a portion of SEQ ID NO: 1 or 3 under at least low stringency 

20 conditions, wherein the portion is at least 1 S nucleotides in lengOi, 

[0021] Another aspect of the present invention provides methods for modulating, 
especially stimulating or enhancing, disease resistance in a plant. These methods 
generally comprise introducing an expression construct or vector as broadly described 
above into the nucleome of the plant and regenerating stably transformed plants. In 

2S some embodiments, tiie expression construct or vector is introduced into regenerable 
plant cells so as to yield transformed plant cells, which are suitably identified and 
selected, and which are subsequently used for regen^ting differentiated plants . 
Typically, a transformed plant cell line is selected Sx>m the transformed plants cells for 
the differentiation of a genetically modified or transgmc plant In some embodiments, 

30 the reg^erable cells are regenerable dicotyledonous plant cells. In other embodiments, 
the re^nerable cells are regenerable monocotyledonous plant cells such as regenerable 
graminaceous monocotyledonous plant cells and especially regenerable non- 
graminaceous monocotyledonous plant cells. Jn one example, tiie regenerable plant cells 
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are legenerable banana cells. In certain advantageous embodiments, the expression of 
the nucleic acid construct renders the difierentiated transgraiic plant disease with 
enhanced resistance to disease particularly diseases caused by fungal pathogens* 
especially soil borne fungi such as Fusarium species. Desirably, the nucleic acid 
5 construct is transmitted ttiroug^ a complete cycle of the difTermtiated transgenic plant 
to its progeny so that it is expressed by the progeny plants. Thus> the invention also 
provides seed, other plant parts, tissue, and progeny plants derived from the 
diff^entiated transgenic plant. 

[0022] In still another aspect, the invention contemplates conventional plant 

10 breeding methods to transfer genetic material corre^ondhig to a polynucleotide as 
broadly described above via crossing and backcrossing to other plants, especially plants 
that are susceptible to a pathogenic disease, especially a disease caused by fungal 
pathogens such as species of Fusarium. In some embodiments, the genetic material will 
comprise naturally-occurring DNA that corresponds to a polynucleotide as broadly 

15 described above. Typically, fliese methods will comprise the steps of: (1) sexually 
crossing a plant containing that genetic materiial with a plant fiom a pathogen 
suscqptible taxon; (2) recovering reproductive material from the progeny of the ctoss; 
and (3) growing plants with enhanced resistance to the disease from the reproductive 
material, hi some embodiments, the methods will further comprise prior to stq> (1): 

20 identifying a plant that is resistent to the pafliog^iic disease by detecting expression by 
tiie plant of a polynucleotide as broadly described above. In certain advantageous 
embodimrats, these methods will further comprise the stq?s of repetitively: (a) 
backcrossing the disease resistant progeny with disease susceptible plants fix>m the 
suscq>tible taxon; and (b) selecting for expression of a nucleic acid sequence 

25 corresponding to a polynucleotide as broadly described above (or an associated marker 
gene) among the progeny of the backcross, until tfie desired percentage of the 
characteristics of the susceptible taxon are present in the progeny along with the gene or 
genes imparting the pathogen resistance. 

[0023] In another aspect of the invention, tti«re is provided isolated polyp^tides 
30 that confer disease resistance to a plant These polypeptides are generally selected from; 
(i) a polypq)tide comprising an amino acid sequence that shares at least 50% sequence 
snnilarity with tfie sequence set forth in SEQ DO NO: 2 or 4; (ii) a polypeptide 
comprising a portion of at least 100 contiguous amino acid residues of ttie sequence set 
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forth in SEQ ID NO: 2 or 4, wherein the polypeptide confer disease resistance to a 
plant; (iii) a polypeptide comprising an amino acid sequence that shares at least 30% 
similarity with at least a portion of 100 contiguous amino acid residues of the sequence 
set forth in SEQ ID NO: 2 or 4; and (iv) a polypeptide comprising a portion of at least 5 
5 contiguous amino acid residues of the sequence set forth in SEQ ID NO: 2 or 4, wherein 
the portion is immuno-interactive with an antigen-binding molecule that is immuno- 
interacdve with a sequence of (i), (ii) or Qii). 

[00241 En some embodiments, the polypeptide includes one or more and in some 
cases all of the following domains (the numbering refers to the consensus numbering in 
10 Figure 2): 

a domain which corresponds to residues 1-167 of Figure 2. This domain 
may be structurally similar to a coiled coil. In some embodiments, this domain can have 
at least 60, 70, 80, 90, 95, or 98 % sequence similarity with, or have at least 30, 40, 50, 
60, 70 or 80 % sequence identity to, or differ at no more than 1, 2, 3, 4, 5, 10, 15, 20, 
15 25, 30 or 40 amino acid residues from, the corresponding domain of any of the 
sequences presented in Figure 2; 

a domain which corresponds to residues 168-536 of Figure 2. This domain 
may be functionally analogous to a nuclear-binding site (NBS) domain. In some 
embodiments, this domain can have at least 70, 80, 90, 95, or 98 % sequence similarity 
20 with, or have at least 50, 60, 70, 80 or 90 % sequence identity to, or differ at no more 
than 1, 2, 3, 4, 5, 10, 15, 20, 25, 30 or 40 amino acid residues from, the corresponding 
domain of any of the sequences presented in Figure 2; and 

a domain which corresponds to residues 537-1476 of Figure 2. This domain 
may be functionally analogous to a leudne-rich r^eat (LRR) domain. In some 
25 embodiments, this domain can have at least 60, 70, 80, 90, 95, or 98 % sequence 
similarity with, or have at least 30, 40, 50, 60, 70, 80 or 90 % sequence identity to, or 
differ at no more flian 1, 2, 3, 4, 5, 10, 15, 20, 25, 30 or 40 amino acid residues from, ttie 
corresponding domain of any of the sequences presented in Figure 2. 

[0025] In some embodiments, it may be desirable to conserve one or more of the 
30 residues in the above regions, which are cons^ed between the sequences presented in 
Figure 2, wherein the conserved amino acid residues correspond to identical residues or 
to residues belonging to the same class or subclass of amino acid residues. 
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[00261 In some embodiments, the domain corresponding to residues 1-167 of 
Figure 2 conqirises a sequence according to Fonnula (I): 

Ser-<i>aai-<I>aa2 Zaa-Xaai-Oaa3-€>aa4 Xaaz-Baai Saai-Xaa3-Asn-Xaa4*-Xaa5-<t>aa5- 
Xaa6-Xaa7-Leu-Xaa8-Xaa9-Xaaio-Xaaii-Xaai2-Xaai3-Baa2-Xaai4-Aaai-Leu-Xaai5- 
5 Xaai6-I^u-Xaan-Xaai8-raa2-Leu-Leu-Arg-Xaai9'His-Saa3-Oaa6rLeu-Aaa2-QM^ 
Ala-Qaa2-2aa4-Arg-Xaa2(rXaa2rXaa22-Xaa23-Xaa24-Xaa25-Xaa26-Ser-Leu-Val- 
Xaa27-Oaa7-Oaa8-Xaa28-Xaa29-I^u-Lys-Aaa3-Xaa3<rAla-Tyr-Asp-Ala-Aaa4-Asp- 
Oaa9-Leu-Aaa5-Glu-<Pdaio-Glu-Xaa3rXaa32-Ala-Xaa33-Baa3-Xaa34-I'y 

wherein: each of Ouio is a hydrophobic amino acid residue, 
10 Zaa is a neutral/polar amino acid residue^ 

each of SaaM is a small amino acid residue, 

each of Baai.3 is a basic amino acid residue, 

each of Aaat.s is an acidic amino acid residue, 

each of Qaai.2 is a charged amino acid residue, and 
15 Xaai.33 each indqiendently selected finom any amino acid residue. 

[0027] In some embodiments, Zaa is selected &om Gin or Asn. 

[0028] In some embodiments, Oaai is selected fiom Phe or Leu. In some 
embodiments, cl>aa2 is selected from lie or Val. hi some embodiments, Oaas is selected 
from Leu or lie. hi some embodiments, Oaa^ is selected from Leu or Phe. In some 
20 embodiments, <I>aa5 is selected from lie or Val. In some raibodiments, Obb^ is selected 
from lie or Leu. In some embodiments, Oanj is selected from Leu or Tip* In some 
embodimmts, Oaas is selected from Val or Leu. In some embodiments, 4»aa9 is selected 
from Leu or He. In some embodiments, Oaaio is selected from Leu or Tip. 

[0029] In some embodimCTts, Zaai is selected from Ala Ser. In some embodiments, 
25 £aa2 is selected from Ser or Thr. In some embodiments, Saas is selected from Ala Ser. 
In some embodiments, £aa4 is selected from Thr or Ala. 

[0030] In some embodiments, Baai is selected from Lys or Arg. In some 
embodiments, Baa2 is selected from His or Arg. In some embodiments, Baa3 is selected 
from Lys or Arg. 

30 [0031 ] In some embodiments, each of Aaa^ is independently selected from Asp or 
Glu. 
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10032] In some embodiments, Qaai is selected firOTi Lys or Glu, In some 
embodimentSt Qaaa is selected from Glu or Ijys. 

[0033] In some embodiments Xaai is a small or acidic amino acid residue, e.g., 
Xaat is selected from Thr or Glu- La some embodiments, Xaa2 is an acidic or 
5 neutral/polar amino acid residue, e.g., Xaaa is selected from Asp or Asil Li some 
embodiments, Xa^ is a small or hydrophobic amino acid residue, e.g., Xaa3 is selected 
from Ser or lie. In some embodiments, Xaa4 is a neutral/polar or hydrophobic amino 
acid residue, e.g., Xaa4 is selected from Cys or Leu. In some embodiments, Xaas is a 
small or hydrophobic amino acid residue, e.g., Xaas is selected from Ala He, lii some 

10 embodiments, Xaag is a neutral/polar or small amino acid residue, e.g., Xaaa is selected 
from Gin or Ala. In some embodiments, Xbslj is a neutral/polar or acidic amino add 
residue, e.g., Xaoj is selected from Gin or Glu, In some embodiments, Xaag is a small or 
basic amino acid residue, e,g., Xaag is selected from Ala Arg. lii some embodiments, 
Xaa^ is a basic or hydrophobic amino add residue, e,g., Xaa9 is selected from Arg or 

15 Leu. 

[0034] In some embodiments, Xaaio is a basic or neutral/polar amino acid residue, 
e.g,, Xaaio is selected from Arg or Gta. In some embodiments, Xaan is a basic or 
hydrophobic amino acid residue, e.g., Xaan is selected from Arg or Leu. In some 
embodiments, Xaa^ is a small or neutral/potar amino acid residue, e,g., Xaan is 

20 selected &cm Arg or Gin. In some CT:sbodiments, Xaa^ is a hydrophobic or small amino 
acid residue, e.g., Xaau is selected from Leu or Ala. In some embodiments, Xaai4 is an 
acid or small amino acid residue, e.g., Xaan 'is selected from Asp or Ala, In some 
embodiments, Xaats is a basic or neutral/polar amino acid residue, e,g,, Xaais is 
selected from Arg or Asa In some embodiments, Xaai6 is a basic or neutral/polar amino 

25 acid residue, e.g., Xaai6 is selected from Arg or Asn. In some embodiments, Xaan is a 
basic or neutral/polar amino acid residue, e.g., Xaai? is selected from Arg or Gin. In 
some embodiments, Xaaig is a small or basic amino acid residue, e.g., Xaajs is selected 
from Thr or Arg. In some embodiments, Xaai9 is a hydrophobic or small amino acid 
residue, e.g., Xaai9 is selected from lie or Thr. 

30 10035] In some embodiments, Xaaao is a hydrophobic or basic amino add residue, 
e.g., Xaa2o is selected from Trp or Arg. In some embodiments, Xaaai is absent or is a 
neutral/polar amino acid residue, e.g., Asn. In some embodimoits, Xaa22 is a basic or 
hydrophobic amino acid residue, e.g., Xaa22 is selected from His or Met hi some 
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embodiments, Xaa23 is a basic or small amino acid residue, e.g., Xaaas is selected from 
Lys or Thr. In some embodiments, Xaa24 is a neutral^iolar or acidic amino acid residue, 
e.g., Xaa24 is selected fiom Asn or Asp. la some embodiments, Xaaas is a small or basic 
amino acid residue, e.g., Xaa2s is selected from Thr or Lys. In some embodiments, 
Xaa26 is an acidic or hydrophobic amino acid residue, e.g., Xaa26 is selected from Glu or 
Leu. In some embodiments, X27 is a basic or hydrophobic or amino acid residue, e.g., 
Xaa27 is selected from Arg or Met In some embodimmts, Xaaas is a neutral/polar or 
acidic amino acid residue, e.g., Xaaz^ is selected from Gin or Glu. In some 
embodiments, Xaa29 is a small or hydrophobic amino acid residue, e,g., Xaa29 is 
selected from Ala Tip. 

[0036] In some embodiments, Xaaao is a hydrophobic or small amino acid residue, 
e.g., Xaa3a is selected from Tyr or Ala. In some embodiments, Xaaai is a neutral/polar 
or small amino acid residue, e.g., Xaaai is selected from Gin or Ala. In some 
embodiments, Xaa32 is a small or hydrophobic amino acid residue, e.g., Xaa32 is 
selected from Ala He. In some embodiments, Xaasa is a neutral/polar or hydrophobic 
ammo acid residue, e.g., Xaass is selected from Gin or Leu. 

[0037] In some embodiments, the domain corresponding to residues 168-336 of 
Figure 2 comprises a sequence according to Formula (II): 

Arg-Xaai-Xaa2-Thr-Zaai-Ser-Oaai-I^-Thr-GIu-2aa2-Xaa3-Oaa2-Oaa3-Gl^^ 

Xaa4-Gln-Aaai-Baai-Glu-Xaa5-Oaa4-Oaa5-fiaai-Leu-Leu-Leu-Aaa2-Eaa3-5:aa4- 

Xaa6-Gly-Xaa7-Xaa«-Saa5-Phe-Saa6-Val-Oaa6-Pro-Oaa7-Val-Gly-^aag-Gly-^ 

Xaa^Gly-Lys-Thr-Thr-Leu-Laa7-Gln-I^u-Oaa9-a>aaio-Asn-Asp-Xaaio-A^ 

Xaai i-Xaai 2-Xaai3-Phe-Xaai4-Leu-Baa2-<I>aau-Tip-Val-Cys-Val-SCT-Asp-Xaai5- 

Phe-Xaai6-Val-Lys-Arg-Oaa,2-Thr-Baa3-Glu-ne-Xaai7-Glu-Xa^ 

Xaa20'Qaa2-Xaa2i-Xaa22-Asp-Xaa23-Xaa24-Asn-Leu-Xaa25-Xaa26-Leu-G 

Xaa28-Leu-Lys-Glu-Qaa3-ne-Xaa29-Eaa8-Xaa3<rXaa3i-Phe-Leu-Leu-V^ 

Aq)-Val-Trp-Xaa32-Glu-Xaa33-Xaa34-Xaa35-fiaa4-Tip-Glu-Xaa36-I^^ 

Pro-i:^-naa5-Xaa38-X:aaa-Saaio-Arg-<31y-Ser-Xaa39-Val-n^ 

Xaa4i-Xaai2-Lys-<>aai3-Ala-Xaa43-^aai4-Xaa44'<jly-Thr-Met-n^ 

Xaa4fi-LeU'Aaa3-Xaa(7-Leu-Xaa48-Aaa4-Asp-Xaa49-Xaa5o-T]p-Xaa5i-Leu- 

naa7-Xaa52-Xaas3-2aaii-Phe-Xaa54-Xaa55-Xaas6-Xaa5r-Xaasg-2aai2-Xaa5^Xaa« 

Xaaai-Xaafi2-naa8-<I>aai7-Glu*Xaa63-ne-01y-Arg-Lys-ne-Ala-Xaa64-Lys-Oaai8- 
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Xaafi5-Gly-Xaa66-Pro-Oaai9-2aai3-Ala-Xaad7-Saai4-Oa^^^^ 
Arg-Xaa6irnaa9-Xaa7(r2aai5-Xaa7i-Xaa72-Xaa73-Trp-Aig.^^ 
Glu-2aai6-Glu-Xaa75-Tip-Xaa76-Oaa24-I^-Xaa77-Ala-Xaa78-Xaa7r^ 
l^u-Saai7-Xaa8o-Leu-Xaa8i-Xaa82-5cr-Tyr-Xaa83-XaaM-I^-Pn)-^ 
5 l^u-Baa4-Xaa86-Cys-Phe-Ala-Phe<;ys-Ala-^>aa26-Phe-Xaa87-Lys-Xaa8s- 
Xaa8rPh©-Xaa9(rLys-Qaaio-Xaa9rI^u-ne-Xaa92-Xaa93-Trp-De-Ala-X^ 

Oaa27-ne 

wherein: each of <Pi.27 is a hydrophobic amino acid residue, 
each of Saai.i$ is a smaH amino acid residue, 
10 each of Baai^ is a basic amino acid residue^ 

each of Aaai.s is an acidic amino acid residue, 
each of Qaai.io is a charged amino acid residue, and 
Xaai-M are eadi independently selected from any amino acid residue. 

[0038] In some embodiments, Saai is selected from Set or Hir. hi some 
15 embodiments, Saaa is selected from Thr or Ser. hi some embodimCTits, Saas is selected 
from Ser or Pro. In some embodiments, Saat is selected from Gly or Ser. hi some 
embodiments, Saas is selected from Ser or Ala. hi some embodunents, Saag is selected 
from Ser or Pro. In some embodiments, 2aa7 is selected from Ala or Ser. hi some 
embodiments, Saa« is selected from Ser or Gly. hi some embodiments, Saag is selected 
20 from Ala or Gly. 

[00391 some embodiments, Saaio is selected from Ala or Gly. hi some 

embodiments, 2aan is selected from Ala or Ser. hi some embodiments, Saan is selected 
fiom Pro or Ser, In some embodnnents, Saaia is selected from Ala or Gly. hi some 
embodiments, £aau is selected from Thr or Ala. In some embodiments, Saais is 
25 selected fiom Ser or Gly. In some embodunents, Saaie is selected from Ser or Thr. In 
some embodunents, Saan is selected from Pro or Ser. In some embodiments, Saais is 
selected fiom Gly or Pro. 

[0040] In some embodiments, <I>aai is selected fiom Phe or Leu. hi some 
embodiments, 3>aa2 is selected from Val or He. hi some embodiments, Oaas is selected 
30 from Phe or Val. hi some embodiments, <I>aa4 is selected from Val or Leu. In some 
embodiments, Oaas is selected from Val or lie. ha some embodunents, <Daa6 is selected 
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from Leu or Val. In some embodimCTts, Oaa? is selected from Lea or He. Tn some 
embodimmtSy Oaa^ is selected from lie or Val. In some embodiments, Osag is selected 
. from Val or He. 

[00411 In some embodiments, Oaaio is selected from Tyr or Phe. In some 
5 embodiments, Oaau is selected from Val or Met. In some embodiments, Oaaiz is 
selected from Leu or He. In some embodiments, Oaai3 is selected from lie or VaL In 
some embodiments, <l>aai4 is selected from lie or Val. In some embodiments, Oaais is 
selected from lie or Tyr, In some embodiments, Oaaie is selected from Phe or He. In 
some embodiments, Oaan is selected from Leu or Met In some embodiments, <I>aai8 is 
10 selected &om Leu or lie. In some embodimmts, (I>aai9 is selected from Leu or Tyr. 

(0042] In some embodiments, Oaa2o is selected from Leu or Met. In some 
embodiments, Oa£^] is selected from Leu or Tyr. In some embodiments, Oaa22 is 
selected from He or Val. In some embodiments, 4>aa23 is selected from Met or Leu. In 
some embodiments, <E>aa24 is selected from Leu or Met In some embodiments, Oaaas is 
IS selected from He or Val. In some onbodiments, <Pasi26 is selected from Val or Leu. In 
some embodiments, (I>aa27 is selected from Phe or Leu. 

[0043] In some embodiments, BaaM are each independently selected from Arg or 
Lys. 

[0044] In some embodiments, each of Aaai.5 is independently selected from Asp or 
20 Glu, 

[0045] In some embodiments, £2aai is selected from Glu or Arg. In some 
embodiments, CIbbz is selected from Glu or Arg. In some embodiments, f2aa3 is selected 
from Lys or Glu. In some embodiments, £2aa4 is selected from Asp or Lys. In some 
embodiments, Qaas is selected from Arg or Asp. la some embodiments, £2aa6 is selected 
25 from Lys or Glu. In' some CTibodiments, Qaa? is selected from Lys or Glu. In some 
embodiments, £2aag is selected from Glu or Arg. In some embodiments, Qaa9 is selected 
from Asp or Lys. In some embodiments, Qaa^o is selected from His or Asp. 

[0046] In some embodiments Xaai is a basic or small amino acid residue, e.g., Xaai 
is selected from Arg or Gly. In some embodiments, Xaaz is an acidic or hydrophobic 
30 amino acid residue, e.g., Xaaz is selected from Glu or Val. In some embodiments, Xaaj 
is a hydrophobic or neutral/polar amino acid residue, e.g., Xaaa is selected from Val or 
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Cys. In some embodiments, Xaai is an acidic or small amino add residue, e.g., Xad4 is 
selected from Asp or Ala. In some embodiments, Xaa^ is a basic or neutral/polar amino 
acid residue, e.g., Xaas is selected from Lys or Asn. In some embodimrats, Xaoe is a 
small or acidic amino acid residue^ e.g., Xaa6 is selected from Ser or Asp. In some 
S embodimaots, Xaa? is absent or is a neutral/polar amino acid residue, e.g., Asn. In some 
embodiments, Xaas is absent or is a small amino acid residue, e.g., Ser. In some 
embodiments, Xaa9 is a hydrophobic or small amino acid residue, e.g., Xaa$ is selected 
from Valor Ala. 

[0047] Li some embodiments, Xaaio is a neutral/polar or basic amino acid residue, 
10 e-g., Xaaio is selected from Asn or lys. In some embodiments, Xaai i is a small or acidic 
amino acid residue, e.g., Xaan is selected from Gly or Glu. In some embodiments, 
Xaai2 is a neutral/polar or acidic amino acid residue, e.g., Xaau is selected from Asn or 
Glu. In some embodiments, Xaaia is a hydrophobic or basic amino acid residue, e.g., 
Xaa]3 is selected from Tyr or His. In some embodiments, Xaai4 is a basic or small 
IS amino acid residue, e.g., Xaau is selected from His or Pro. In some embocUments, Xaai5 
is a neutral/polar or acidic amino acid residue, e.g., Xaai ^ is selected from Asn or Asp. 
In some embodiments, Xaa]6 is a neutral/polar or acidic amino acid residue, e.g., Xaai6 
is selected from Asn or Asp. &i some embodiments, Xaan is a hydrophobic or small 
amino acid residue, e.g., Xaan is selected from lie or Thr. In some embodiments, Xaais 
20 is a small or hydrophobic amino acid residue, e,g., Xa&is is selected from Ser or Tyr. In 
some embodiments, Xaai9 is a basic or neutral/polar amino acid residue, e.g., Xaaip is 
selected from Lys or Asn. 

[0048] In some embodiments, Xaazo is a hydrophobic or small amino acid residue, 
e.g., Xaa2o is selected from Val or Gly. In some embodiments, Xaa2i is a neutral^olar 

25 or hydrophobic amino acid residue, e.g., Xaaai is selected tcom Gin or Phe. In some 
embodiments, Xaa22 is a small or hydrophobic amino acid residue, e.g., Xaa22 is 
selected fiY>m Ser or Met. In some embodiments, Xaa23 is a basic or hydrophobic amino 
acid residue, e.g., Xaa23 is selected from Lys or Leu. In some embodiments, Xaa24 is a 
hydrophobic or small amino acid residue, e.g,, Xaa24 is selected from Leu or Thr. In 

30 some embodiments, Xaa25 is an acidic or neutral/polar amino acid residue, e.g., Xaa25 is 
selected from Asp or Asn. In some embodiments, Xaa26 is a small or hydrophobic 
amino acid residue, e.g., Xaa26 is selected from Thr or Met. In some embodiments, 
Xaa27 is a nratral/polar or hydrophobic amino, add residue, e.g., Xaa27 is selected from 
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Gin or Val. In some embodiments^ Xaa28 is a hydrophobic or neutral/polar amino acid 
residue, e.g., Xaa28 is selected from lie or Asn. In some embodiments, Xaa29 is a small 
or basic amino add lesidue, e.g., Xaa29 is selected from Ala or Arg* 

[0049] In some embodiments, Xaaao is an acidic or small amino add residue, e.g., 
5 Xaaao is selected fix)m Glu or Thr. In some embodiments, X31 is a basic or small amino 
acid residue, e.g.» Xaajj is selected &om Arg or Tin:. In some embodiments, Xaa32 is a 
small or neutral/polar amino acid residue, e.g., Xaa32 is selected fiom Ser or Asn. In 
some embodiments, Xaa^s is a neutral/polar or acidic amino acid residue, e.g., Xaasa is 
selected &om Asn or Asp. In some embodiments, Xaa34 is a basic or small amino add 

10 residue, e.g., Xaa34 is selected from Aig or Pro. In some embodiments, Xaajs is an 
acidic or hydrophobic amino acid residue, e.g., Xaaas is selected from A^ or Val. In 
some embodiments, Xaasa is a basic or small amino acid residue, e.g., Xaajs is selected 
from Arg or Ser. In some embodiments, Xaas? is a neutral^olar or hydrophobic amino 
acid residue, e.g., Xaa37 is selected from Cys or Leu. In some embodiments, Xaasg is a 

IS hydrophobic or small amino add residue, 6.g., Xaasg is selected from Phe or Ala. In 
some embodiments, Xaa39 is a basic or hydrophobic amino acid residue, e.g., Xaasg is 
selected from Lys or VaL 

[0050] In some embodiments, Xaa4o is a basic or neutral^olar amino add residue, 
e.g., Xaa4o is selected from Arg or Gin* Li some embodiments, X41 is an addic or small 

20 amino acid residue, e.g., Xaa4i is selected from Asp or Ser. In some embodiments, 
Xaa42 is a small or basic amino acid residue, e.g., Xaa42 is selected from Thr or Lys. In 
some embodiments, Xaa43 is a small or acidic amino add residue, e.g., Xaa43 is selected 
from S&r or Asp. In some embodiments, Xaa44 is a hydrophobic or small amino acid 
residue, e.g., Xaa44 is selected from He or Thr. hi some embodiments, Xaa45 is an acidic 

25 or small amino add residue, e.g., Xaa45 is selected from Glu or Pro. In some 
embodimmts» Xaa^e is a small or hydrophobic amino acid residue, e.g., Xaa46 is 
selected from Ser or Val. In some embodiments, Xaa47 is a small or acidic amino acid 
residue, e.g., Xaa47 is selected from Gly or Glu. In some embodiments, Xaa48 is a 
neutral/polar or small amino acid residue, e.g., Xaa4g is selected from Gin or Thr. In 

30 some embodiments, Xaa49 is a small or acidic amino add residue, e.g., Xaa49 is selected 
from Ala or Asp. 

[0051] In some embodiments, Xaaso is a hydrophobic or small amino add residue^ 
e.g., Xaa^o is elected from Tyr or Ser. In some embodimmts, X51 is an acidic or small 
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amino acid residue, e.g., Xaasi is selected from Glu or S^. In some embodiments, Xaas2 
is a basic or small amino acid residue, e.g., Xaa52 is selected from Lys or Ser. In some 
embodiments, Xaasa is a neutral/polar or basic amino acid residue, e.g., Xaas3 is 
selected from Cys or His. In some embodiments, Xaa54 is a smaU or basic amino acid 
S residue, e.g., Xaa54 is selected from Gly or Arg. In some embodiments, Xaass is a small 
or acidic amino acid residue, e.g., Xaass is selected from Ser or Glu. In some 
embodiments, Xaa56 is a hydrophobic or small amino acid residue, e.g., Xaas^ is 
selected from Val or Ala. In some embodiments, Xaas? is a neutral/polar or small amino 
acid residue, e.g., Xaas? is selected from Asn or Ser. In some embodiments, Xaass is 
10 absent or is a neutral/polar amino acid residue, e.g., Cys. In some embodiments, Xaa59 
is a neutral/polar or small amino acid residue, ag., Xaas9 is selected from Ghi or Ser. 

[0052} hi some embodiments, X^o is an acidic or small amino acid residue, e.g., 
Xaa^o is selected from Glu or Pro. In some embodiments, X^i is a basic or neutral/polar 
amino acid residue, e.g., Xaaat is selected from IGis or Asn. In some embodiments, 

IS Xaa62 is a hydrophobic or small amino acid residue, 6.g., Xaa62 is selected from Leu or 
Pro. In some embodiments, Xaa^j is a hydrophobic or acidic amino acid residue, e.g., 
Xaa63 is selected from Val or Glu. In some embodimmts, Xaa64 is a small or basic 
amino acid residue, e.g., Xaa64 is selected from Gly or Lys. In some embodiments, 
Xaa^s is a basic or small amino acid residue, 6.g., Xaa^s is selected from Lys or Ser. In 

20 some embodiments, Xaa66 is a small or hydrophobic amino acid residue, e.g., Xaa66 is 
selected from Ser or Leu. In some embodiments, Xaa67 is a basic or small amino acid 
residue, e.g., Xaaa? is selected from Lys or Thr. In some embodiments, Xaaes is a small 
or basic amino acid residue, e.g., Xaa68 is selected from Ser or Arg. In some 
embodiments, Xaa^g is a hydrophobic or small amino acid residue, 6.g., Xaa^p is 

25 selected from Leu or Ser. 

[0053] In some embodiments, Xaayo is a hydrophobic or basic amino acid residue, 
6.g., XaaTo is selected from Val or His. In some embodiments, X71 is a neutral/polar or 
acidic amino acid residue, e.g., Xaaji is selected from Gin or Glu. in some 
embodiments, Xaa72 is an acidic or small amino acid residue, e.g., Xaa72 is selected 
30 from Glu or Ser. In some CTibodiments, Xaa73 is a basic or small amino acid residue, 
e;g., Xaa73 is selected from His or Ser, hi some embodiments, Xaa74 is a small or acidic 
amino acid residue, e.g., Xaa74 is selected from Thr or Glu. In some ^nbodunents, 
Xaa75 is a hydrophobic or small amino acid residue, e.g., Xaa7s is selected from Val or 
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Tbr. In some embodiments, Xa^e is a neutral/polar or acidic amino acid residue, e.g., 
Xaa76 is selected firom Gin or Glu« In some ^bodiments, Xaa?? is a neutral/polar or 
small amino add residue, e.g., Xaa?? is selected from Gin or Pro. In some ^bodiments, 
Xaa78 is an acidic or small amino acid residue, e.g., Xaa78 is selected fiom Glu or Ala. 
S In some embodiments, Xaayp is a neutral/polar or small amino acid residue, e.g., Xaa79 
is selected fiom Asn or Ser. 

[0054] In some embodiments, Xaaso is a hydrophobic or small amino acid residue, 
e.g., Xaaeo is selected fiom Val or Ala. In some embodiments, Xsi is a hydrophobic or 
basic amino acid residue, e.g., Xaasi is selected fiom Trp or Arg. In some embodiments, 

10 Xaa82 is a hydrophobic or basic amino acid residue, e.g., Xaaga is selected firom Leu or 
Arg. In some onbodiments, Xaags is a neutral/polar or acidic amino acid residue, e.g., 
Xad83 is selected fiom Gin or Asp. In some embodiments, Xaa84 is a basic or 
neutral/polar amino acid residue, e.g., Xaag4 is selected fix>m His or Asn. In some 
embodiments, Xaass is a basic or neutral/polar amino acid residue, e.g., Xaags is 

1 S selected fixmi His or Ghi. In some embodiments, Xad86 is a neutral/polar or hydrophobic 
amino acid residue, e.g., Xaase is selected &om Gin or Leu. In some ^nbodiments, 
Xaa87 is a basic or small amino acid residue, e.g., Xaag? is selected fix>m His or Thr. In 
some embodiments, Xaagg is an acidic or small amino acid residue, e.g., Xaags is 
selected from Asp or Gly. In some embodiments, Xaa89 is a hydrophobic or basic amino 

20 acid residue, e.g., Xaas9 is selected fiom Leu or Arg, 

[0055] In some ^bodiments, Xaa9o is a hydrophobic or basic anuno acid residue, 
e.g., Xaa9o is selected fiom Tyr or Arg. In some embodiments, Xaagi is an acidic or 
small amino acid residue, e.g., Xaa9i is selected from Glu or Thr. In some embodiments, 
Xaa92 is a neutral/polar or basic amino acid residue^ e.g., Xaa92 is selected from Ghi or 
25 His. In some embodiments, Xaaoa is a small or hydrophobic amino acid residue, e.g., 
Xaa93 is selected from Thr or Met. In some ^bodiments, Xaa94 is an acidic or 
neutral/polar amino add residue, e.g., Xaa94 is selected fix>m Glu or Gin. In some 
embodimmts, Xaa^s is a small or neutral/polar amino acid residue, e.g., Xaa^s is 
selected fix>m Gly or Asn. 

30 [0056] In some embodiments, the domain corresponding to residues S37-1476 of 
Figure 2 comprises a sequence according to Formula (II): 

l^u-Xaarflaaj-Xaa2-Oaai-Phe-Baai-Xaa3-Leu-Xaa4-Arg-Ile-Baa2-Val-Leu-Xaa5- 

<I>da2-Xaa^-Xaa7-Cys-Xaa8-<I>aa3*Baa3-Xaa9*Leu-Pro-Xaaio*Xaai i -a>aa4-Gly- 
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Xaaj2-Leu-Xaai3-Xaai4-I^u-AiB-TjT-I^uOCaai5-Oaa5-5CT-Xaai6-Am 

ne-Gln-Arg-Leu-Pio-Glu-Ser-Oaa6-XaaiB-I2aa2-I^^ 

I^u-Xaa2i-Ixiu-Xaa22-Gly-^ys-Xaa23-I^-Xaa24-Xaa25-^aa7*Pr^ 

Met-Ser-Baa4-I^u-Oaa8'Xaa27-Leu-Arg-Gln-I^ii-Baa5-Xaa28-Xaa29-^^ 

Oaa9-Ile-2aa4-Qaa3-ne-Xaa3rQaa4-Val-Gly-Baa6-I^u-ne-Xaa32-I^u-Gte 

I^-Xaa33-Ala-Oaaio-Xaa34-Val-Xaa35-Xaa36-Baa7-Xaa37-Gly-Xaa38-Xaa39-a>M 

Ala-Glu-Leu-Ser-i;aa5-Oaai2-Xaa4o-Gln-Leu-Baa8-2aa6-Xaa4rI^u-Xaa42-^^ 

Xaa43-ACT-Leu-XaaM-Asn-Val-Xaa45-Xaa46-Xaa47-f2aa5-Glu-2aa7-Xaa45-Lys- 

Ala-Baa9-Leu-naa6-Oaa7-Lys-Gln-Xaa49-Leu-flaa8-Xaa5o-Leu-Aaa2-Leu-Qaa9- 

TTi>-AIa-Xaa5i-GIy-Xaa52-Xaa53-Xaa54-Xaa55-Xaa56-Xaa57-Xaas8-Glii-Xaa59- 

Xaa«rXaa6rXaa62-^^aio-£2aan-Val-Lcu-Xaa63-Gly-I^u-Xaa64-Pro-His-Xaa65- 

Xaa66-Leu-Baaio-Xaa67-Leu-2aa8-Ile-Baai i-Xaa68-Tyr-Saa9-Gly-Saaio-Saai r 

Xaa«9-Pro-Ser-Tii>-<I>aai3-Xaa7o-Xaa7i-Xaa72-Oaai4-Leu-Pro-Asn^ 

Thr-Oaai6-Baai2-Leu-naai2-Xaa74-Cy5-2aai2-Arg-L^u-Xaa75-Xaa76-^^ 

Xaa77-Oaai7-Qly-<jln-Lcu-Xaa78-Xaa79-Lcu-Baai3-Xaa8o-I^u-His-^ 

3toa«i-M€t-Saai4-Xaa82-Val-Baai4<jln-Oaai9-Xaa83-Xaa84-Xaa85-Oaa^ 

Gly-Xaa«7-2aai5-naai4-Xaa88-Xaa89-Xaa9(rPh©-Pn>-Xaa9i-Leu-Glu^ 

Xaa93-Oaa2rnaai5-naai6-Met-Pro-Saai6-Leu-ilaai7-Glu-*aa22 

wherein: each of O1.22 is a hydrophobic amino acid residue, 
each of Saat.i6 is a small amino acid residue, 
each of Baai.i4 is a basic amino acid residue, 
each of Aaai.2 is an acidic amino acid residue^ 
each of naai.i6 is a charged amino acid residue, and 
Xaai.93 are each independently selected fiom any amino acid residue. 

[00571 In some embodim^ts, £2aai is selected from His or Asp. In some 
embodiments, ^^aa2 is selected from Asp or Arg. In some embodiments, Qaas is selected 
from Lys or Asp. In some embodiments, Qaa4 is selected from Glu or Lys. In some 
embodiments, Qaas is selected from Glu or Arg. In some embodiments, Claa^ is selected 
from His or Asp. In some embodimrats, Qaa? is selected from Arg or Glu. In some 
embodiments, Qaag is select from Glu or Lys. In some embodiments, naa9 is selected 
from Glu or Arg. 
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[0058] In some embodiments, Qaaio is selected fiom Glu or Arg. In some 
embodiments, Qaau is selected fiom Glu or Lys. In some embodiments, Qazn is 
selected jfirora Lys or Asp. In some embodiments, Qaan is selected fiom Lys or Glu, In 
some embodiments, Qaai4 is selected fiom Lys or Glu. In some embodiments, Haais is 
5 selected fiom Glu or Arg. In some embodiments, Qaai6 is selected fiom Asp or Arg. 

(0059J In some embodiments, cpaai is selected from Leu or Met In some 
embodiments, <[>aa2 is selected from Leu or Phe, In some embodiments, <I>aa3 is selected 
Scorn Met or lie* lii some embodiments, C>bz^ is selected fiom lie or Val. In some 
embodiments, Oaas is selected fiidm Be or Leu. In some embodiments^ C>aa6 is selected 
10 fiom Leu or Val. In some embodiments, Oaa7 is selected fiom Phe or Leu, In some 
embodiments, (Daaa is selected fiom Ue or Leu. In some embodiments, Oaa9 is selected 
fiom lie or Val. 

[0060] In some embodiments, Oaaw is selected fi:om Phe or Tyr. In some 
embodimmts, *aaii is selected fiom Leu or He. In some embodiments, Oaai2 is 
15 selected from Leu or Met. In some embodiments, <l)aai3 is selected fiom Leu or Met. In 
some embodiments* Oaau is selected fiom Met or Tyr. In some embodiments, Oaais is 
selected fiom Leu or Met. In some embodiments, <Daai6 is selected fiom Leu or lie. In 
some embodiments, Oaa^ is selected from lie or Leu. In some embodiments, Oaaig is 
selected from Met or lie. In some embodiments, Oaat9 is selected from Met or He. 

20 [0061] In some embodiments, <I>aa2o is selected fi^m Leu or Phe. Jn some 
embodiments, Oaa2i is selected fiom Leu or lie. In some embodiments, Oaa22 is 
selected fix>m Phe or Trp. 

[0062] In some embodiments, Baai, Baaa.^ and Baa^.^ are each independently 
selected fiom Aig or Lys. In some ^bodiments, Baa2 and Baas are each independently 
25 selected fiDm His or Arg. In some embodiments, Baa2 is selected from His or Lys. 

[0063] In some embodiments, 2aai is selected Sx)m Ala, or Thr. In some 
embodiments, Saai is selected fix>m Ala or Thr. In some embodiments, Saaa is selected 
from Gly or Sct. In some embodiments, Saa4 is selected from Sct or Ala. In some 
embodiments, Eaa^ is selected from Gly or Ala. In some embodiments, Eaa^ is selected 
30 fix)m Gly or Ser. In some embodiments, Zaa? is selected from Ala or Ser. In some 
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embodiments, Saag is selected fiom Thr or Ser. In some embodiments, Saa9 is selected 
fiom Ser or Giy. 

[0064] In some embodiments, Saaio is selected &om Ala or Thr, In some 
embodiments, Zaan is selected from Thr or Ser. In some embodiments, £aai2 is selected 
5 fix)m Thr or Ala. In some embodiments, Saaia is selected from Ser or Pro. In some 
embodiments, £aai4 is selected fiom Pro or Ser. In some embodiments, Saais is selected 
fiom Thr or Gly. In some embodiments, Eaai^ is selected fix>m Thr or Ser. 

[0065] Jn some embodiments, each of Aaa].2 is ind^endratly selected firom Asp or 
Glu, 

10 [0066] In some embodimmts Xaai is a small or hydrophobic amino acid residue, 
e.g., Xaat is selected fiom Pro or Leu. In some embodiments, Xaaa is an small or basic 
amino acid residue, 6.g., Xaa2 is selected from Ser or Arg. Li some embodiments, Xaas 
is a basic or hydrophobic ammo acid residue, 6.g., Xaa^ is selected fiom Arg or Met. In 
some embodiments, Xaa4 is an basic or small amino acid residue, e.g., Xaa4 is selected 

15 fit>m Lys or Ser. In some embodiments, Xaas is a hydrophobic or acidic amino acid 
residue, e.g., Xaas is selected fiom Val or Asp. In some embodiments, Xaa6 is a 
neutral/polar or small amino acid residue, e.g., Xaa^ is selected from Gin or Ser. hi 
some embodiments, Xaa7 is a basic or neutral4>olar amino acid residue, e.g., Xaa? is 
selected from Lys or Asn. In some embodiments, Xaaa is a small or hydrophobic amino 

20 acid residue, 6.g., Xaas is selected from Gly or Val. In some embodiments, Xaa9 is an 
acidic or neutral/polar amino acid residue, e.g., Xaag is selected fix>m Glu or Asn. 

[0067] In some embodiments, Xaaio is an acidic or small amino acid residue, e.g., 
Xaaio is selected fiom Asp or Ser. In some embodiments^ Xaan is a hydrophobic or 
small amino acid residue, e.g., Xaan is selected fiom He or Ser. Jn some embodiments* 

25 Xaan is an acidic or neutral/polar ammo acid residue, e.g., Xaai2 is selected &om Asp 
or Asn. In some embodimmts, Xaai3 is a hydrophobic or basic amino acid residue, e.g., 
Xaai3 is selected fiom He or Lys. In some embodiments, Xaai4 is a neutral4>olar or 
basic amino add residue, e.g., Xaan is selected fiom Gin or His. In some embodiments, 
Xaais is acidic or small amino acid r^due, e.g., Xaais is selected &om Asp or Gly. hi 

30 some embodiments, Xaai6 is absmt or is a hydrophobic amino acid residues, e.g., Tyr. Li 
some embodiments, Xaa^ is a neutral/polar or basic amino acid residue, e.g., Xaan is 
selected from Cys or Arg. In some embodiments, Xaais is a neutral/polar or small 
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amino acid residue, e.g., Xaaig is selected fiom Cys or Thr. In some embodiments, 
Xaai9 is a hydrophobic or neutral^olar amino acid residue, e.g., Xaai9 is selected fiom 
TyrorCys. 

{00681 In some embodunents, Xaa2o is a neutral/polar or hydrophobic amino acid 
S r^idue, e.g., Xaazo is selected fiom Asn or Leu. In some embodiments, Xaaat is a basic 
or hydrophobic amino acid residue, e.g., Xaaai is selected from Arg or Leu. In some 
embodiments, Xaa22 is a hydrophobic or acidic amino acid residue, e.g., Xaa22 is 
selected fiom Tip or Glu. In some embodiments, Xaa23 is a neutral/polar or acidic 
amino acid residue, e.g., Xaa23 is selected fiom Gin or Glu. In some embodimrats, 
10 Xaa24 is a basic or neutral/polar amino acid residue, 6.g., Xaa24 is selected fi*om Arg or 
Cys. In some embodiments, Xaa2s is a small or basic amino acid residue, e.g., Xaa2s is 
selected fiom Ser or Arg. In some embodiments, Xaaae is a neutral/polar or basic amino 
acid residue, e.g., Xaa26 is selected fix>m Ghi or Arg. In some embodiments, Xaa27 is a 
neutral^olar or basic amino acid residue, e.g., Xaa27 is selected fiom Asn or Lys. In 
IS some embodiments, Xaa28 is a hydrophobic or small amino acid residue, e.g., Xaazs is 
selected fiom Val or Ala. hi some embodiments, Xaa29 is an acidic or neutral/polar 
amino acid residue, e.g., Xaaaj) is selected fiom Glu or Asn. 

[00691 In some embodiments, Xaa3o is an acidic or small amino acid residue, e.g., 
Xaaao is selected fiom Asp or Pro. Li some embodiments, X31 is a hydrophobic or small 

20 amino acid residue, e.g., Xaaai is selected 60m Tyr or Ala. In some embodiments, 
Xaa32 is a small or acidic amino acid residue, e.g*, Xaa32 is selected fix>m Ser or Glu. In 
some embodiments, Xas^s is a small or basic amino acid residue, e.g., Xaaaa is selected 
60m Ser or Lys. In some CTOtbodiments, Xaa34 is a basic or neutral/polar amino acid 
residue, e.g., Xaa34 is selected fiom Lys or Asn. In some embodiments, Xaaas is an 

25 hydrophobic or acidic amino acid residue, e.g., Xaa^s is selected fiom L«i or Asp. In 
some ^bodiments, Xaaae is a neutral/polar or basic amino acid residue, e.g., Xaa36 is 
selected fiom Asn or His. In some ^bodiments, Xaas? is a neutral/polar or basic amino 
acid residue, e.g., Xaas? is selected fiom Asn or Lys. In some embodiments, Xaa38 is a 
neutral/polar or basic amino acid r^idue, e.g., Xaass is selected from Asn or Lys. hi 

30 some cmbodunents, Xaa39 is a basic or small amino acid residue, e.g., Xa^9 is selected 
fix>mLysorGly. 

[0070] In some embodiments, Xaa4o is a small or neutral^lar amino acid residue, 
e.g., Xaa4o is selected fiom Thr or Asn. hi some OTibodunents, X41 is an small or acidic 
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amino acid residue, e.g., Xaa4i is selected fiom Thr or Asp. In some embodiments. 
Xaa42 is a basic or smaU amino acid residue, e.g., Xaa42 is selected fiom Arg or Ser. In 
some embodiments, Xaa43 is a smaU or basic amino acid residue, eg., Xaa43 is selected 
from Thr or Arg. In some embodiments, Xaa44 is an acidic or neutral/polar amino acid 
5 residue, e.g., Xaa44 is selected fiom Glu or Gin. In some embodiments, Xaais is a small 
or acidic amino add residue. e.g., Xaa45 is selected fiom Gly or Glu. In some 
embodiments, Xaa46 is a small or basic amino acid residue, e.g., Xaa^s is selected &ma 
Ser or Lys. In some embodiments, Xaa^ is a basic or smaU amino acid residue, eg., 
Xaa47 is selected fiom Lys or Thr. In some embodiments, Xaa48 is a small or basic 
10 amino acid residue, e.g,, Xaa4B is selected fiom Ser or Aig. In some embodiments, 
Xaa49 is a hydrophobic or basic amino acid residue. e.g., Xaa49 is selected fiom Tyr or 
Lys. 

(00711 In some onbodimoits, Xaoso is a small or hydrophobic amino acid residue, 
e.g., Xaaso is selected &om Ala or Leu. In some embodiments, X51 is a hydrophobic or 

15 acidic amino acid residue, e.g., Xaasi is selected fiom Ala or Asp. In some 
embodiments, Xaa52 is a neutoal/polar or basic amino acid residue, e.g., Xaa32 is 
selected fiom Gin or Arg. In some embodiments, Xaasa is a hydrophobic or smaU amino 
acid residue, e.g.. Xaass is selected fiom Val or Gly. In some embodimwits, Xaas4 is 
absent or is a small amino acid residue, e.g., Ser. In some embodiments, Xa&ss is absent 

20 or is a small amino acid residue, e.g., Ser. In some embodiments, Xaaje is absem or is a 
hydrophobic amino add residue e,g.. Leu, In some embodiments, Xaaa? is an addic or 
hydrophobic amino add residue, e.g., Xaas? is selected fix>m Glu or Ala. hi some 
embodiments, Xaass is a basic or small amino add residue, e.g., Xaass is selected fiom 
His or Gly. In some embodiments, Xaas9 is a hydrophobic or neutral^lar amino add 

25 residue, e.g., Xaa59 is selected fiom Leu or Cys. 

[0072] In some embodiments, Xaaeo is a hydn^hobic or acidic amino acid residue, 
e.g., Xaa«o is sdected from Leu or Asp. Ih some embodiments, Xei is a hydrophobic or 
basic amino acid residue, e.g., Xaa^i is selected from Val or Arg. In some embodiments, 
Xaa62 is a small or addic amino add residue, e.g., Xaa<a is selected fiom Ser or Asp. In 
30 some embodiments, Xaaea is a hydrophobic or basic amino acid residue, e.g., Xaaes is 
sdeeted Siom Leu or Lys. In some embodiments, Xaa^ is a neutral/polar or basic amino 
add redduB, e.g„ Xaa64 is selected firom Ghi or Aig. In some embodiments, Xaaas is a 
basic or small amino acid residue, e.g,, Xaa«s is selected from His or Pro. In some 
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embodiments, Xaa^ is a hydrophobic or neutral/polar amino acid residue, e.g., Xaaee is 
selected fiom Phe or Asn. In some embodiments, Xaa^i is a small or acidic amiao acid 
residue, e.g., Xaae? is selected from So: or Glu. In some embodiments. Xms is a small 
or hydrophobic amino acid residue, e.g., Xaa«8 is selected from Gly or Tyr. In some 
5 embodiments, Xaa^ is a hydrophobic or smaU amino add residue, e.g., Xaa» is 
selected fiom Val or Ser. 

[00731 In sbme embodiments, Xaa/o is an acidic or small amino acid residue, e,g,, 
Xaaro is selected from Asp or Thr. In some onbodiments, X71 is a hydrophobic or acidic 
amino acid residue. e.g., Xaa7i is selected fiom Val or Asp. In some embodunents, 

10 Xaa72 is a basic or neutral/J^olar amino acid residue, e.g., Xaa72 is selected fiom Lys or 
Gta. In some embodiments, Xaan is a small or addic amino acid residue, e.g., X&an is 
selected fiom Gly or Glu. In some embodhnents, Xaa74 is a neutral/polar or small amino 
add residue, e.g., Xaa74 is selected fiom Asn or Ser. In some embodiments, Xaa75 is an 
addic or smaU amino acid residue^ e,g., Xaa75 is selected firom Glu or Thr. In some 

15 embodiments, Xaa76 is a small or addic amino add residue, e.g., Xaa76 is selected fiom 
Gly or Glu. In some embodiments, Xaa77 is a hydrophobic or neutral/polar amnio add 
residue, e.g., Xaa77 is selected firan Tyr or Qys. fa some embodiments, XaaTs is a 
hydrophobic or basic amino acid residue^ e.g., XaaTs is selected fiom Phe or IDs. In 
some embodiments, Xaa?* is a basic or hydrophobic amino add residue, e.g., Xaa79 is 
20 selected fiom His or He. 

[0074] In some embodunaits, Xai^ is a hydrophobic or basic amino acid residue, 
e.g., Xaaso is selected fiom Val or HBs. In some embodiments, Xgi is a smaU or 
hydrophobic amino add residue, e.g., Xaasi is selected fiom Aig or Gly. Ih some 
embodiments, Xaa«2 is a hydrophobic or neutral/^lar amino acid residue, e.g., Xaaga is 

25 seleded fiom Val or Ghi. fa some «nbodunaits, Xaags is a small or naitraiy^lar ammo 
add residue, e.g., Xaaga is selected fiom Ser or Asn. fa some ^bodunents, Xaas4 is a 
bade or hydrophobic amino acid residne, e.g., Xaa^ is selected fiom His or Leu. fa 
some embodiments, Xaags is a neutral/polar or addic amino add residue, e.g., Xaass is 
selected fiom Gfa or Ghi. fa some anbodimoits, Xaoge is a neutral^lar or 

30 hydrophobic amino add residue, e.g., Xaaw is sdected fiom Cys or Tyr. fa some 
embodunraits, Xaa87 is a neutral/polar or small amino acid residue, e.g., Xaa87 is 
selected 60m Cys or Thr. fa some embodimoits, Xaa<8 is a smaU or hydrophobic amino 
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acid residne, e.g., Xaagg is selected from Ser or Val. In some embodiments, Xaagj is a 
basic or smaU amino acid residue, e.g., Xaa89 is selected fiom Lys or Ser. 

100751 In some embodiments, Xaago is a hydrophobic or smaU amino acid residue, 
e.g., Xaaso is selected from Leu or Gly. In some embodiments, Xaa9, is a basic or 
hydrophobic amino acid residue^ e.g., Xaa,, is selected from Arg or Leu. Ih some 
embodiments, Xaan is and acidic or hydrophobic amino acid residue, e.g., Xaasa is 
selected from Glu or Leu. Li some embodiments, Xaa93 is a hydrophobic or 
neutral^olar amino acid residue, e.g., Xaa93 is selected fiom Val or Asn. 

[00761 In yet another aspect, the invention provides antigen-binding molecules that 
are specifically immuno-interactive with a polypeptide or portion as broadly described 
above. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
[00771 Figure 1 is a sdiematic representation of the location and structure of the 
eight main classes of plant disease resistance proteins. 

I0078J Figure 2 is a diagrammatic representation showing an aUgnment of the 
5 amino acid sequences set forth in SEQ ID NO: 2 and 4 using ClustalW multiple 
aKgnment and the PAM250 similarity matrix as disclosed for example by Dayhoff e/ al. 
(1978) A model of evolutionary change in proteins. Matrices for determining distance 
relationships In M. O. DayhoS; (ed.). Atlas of protein sequence and structure. Vol. 5, 
pp. 345-358. National Biomedical Research Foundation. Washington DC; and by 
10 Gonnet efa/., 1992, Science 256(5062): 144301445. 

10079] Figure 3 is a photogr^hic representation showing the migtation of 
amplification products on an agarose gel following an RT-PCR using primers specific 
for each banana NBS class using template RNA fiom M. acuminata ssp. malaccensis 
resistant (R) or susceptible (S) plants. Total RNA was extracted from leaf or root tissue 
and treated with DNAase. C+ lanes, expected ~ 480 bp actin 1 cDNA fhigment; C- 
lanes. no reverse transcriptase; AD, expected -580 bp actin 1 genomic DNA fragment 
with -100 bp intron included. 
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TABLEA 
Brief De scription of the Sequences 



SEQUENCE ID 
JN UMBER 


SEQUENCE 


LENGTH 


SEQIDNO: 1 
SEQIDNO:2 

SEQIDNO: 3 

SEQIDNO: 4 


Nucleotide sequence of RGA5 obtained fiom Calcutta 4 

Deduced amino acid sequoice encoded by SEO ID 
NO: 1 

Nucleotide sequence of RGA2 obtained fiom Musa 
acuminata spp malaccensis 

Deduced amino acid sequence encoded by SBQ ID 
NO: 1 


4380 nts 
1441 aa 

3699 nts 

1232 aa 
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DETAILED DESCRIPTION OF THE INVENTION 

i. Definitions 

I0080J Unless defined otherwise, all technical and scientific teims used hereinhave 
the same meaning as commonly understood by those of ordinary skiU in the art to whidi 
the invention belongs. Although any methods and materials similar or equivalent to 
those described herein can be used in the practice or testing of the present invention, 
preferred methods and materials are described. For the purposes of the present 
invention, the following tenns are defined below. 

[OOSIJ The articles «a" and are used herein to refer to one or to more than one 
(i.e.. to at least one) of the grammatical object of the article. By way of example, "an 
elCTienf* means one element or more than one element. 

[0082J By "about' is meant a quantity, level, value, frequency, percentage, 
dimension, size, amount, weight or lengQi that varies by as much as 30, 25. 20. 25. 10, 
9, 8, 7, 6, 5, 4, 3, 2 or 1 % to a reference quantity, level, value, frequency, percentage, 
15 dim^ion, size, amount, weight or lengdi. 

[00831 By "antigen-binding molecule" is meant a molecule that has binding affinity 
for a target antigen. It will be understood that this term extends to immunoglobulins, 
immunoglobulin fragments and non-immunoglobulin derived protein fiamewoiks that 
exhibit antigen-binding activity. 

20 [00841 As used herein, the term "binds specifically." "specifically immuno- 
interactive" and the Uke refers to antigen-binding molecules that bind or a immuno- 
interactive with the polypeptide or polypeptide portions of the invention but do not 
significantly bind to homologous prior art polypeptides. 

[0085] By "biologically active portion" is meant a portion of a fiiU-length parent 
25 peptide or polypeptide which portion retains an activity of the parent molecule. For 
example, a biologically active portion of polypeptide of the invention wiU retain flie 
ability to confer disease resistance, especially resistance to fungal pathogens such as 
Fusarium.- As used herein, the terai "biologically active portion" includes deletion 
mutants and p^tides, for example of at least about 8, 9, 10, 11, 12, 13, 14, 15, 16, 17 
30 18, 19, 20, 21, 22, 23, 24. 25, 30, 40, 50, 60, 70, 80, 90. 100. 120, 150. 300, 400. 500,' 
600, 700. 800, 900 or 1000 contiguous amino acids, which comprise an activity of a 
parent molecule. Portions of this type may be obtained through the application of 
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standaxd recombinant nucldc acid techniques or synthesised using conventional liquid 
or solid phase synthesis techniques* For example, reference may be made to solution 
syndesis or solid phase synfliesis as described, for exzmplt, in Chapter 9 entitled 
^'Peptide Synthesis" by Athaton and Shephard which is included in a publication 

5 entitled ^'Synthetic Vaccines'^ edited by Nicholson and publidied by Blackwell 
SciCTtific Publications. Alternatively, peptides can be produced by digestion of a 
peptide or polypeptide of the invention with proteinases such as endoLys-C, endoArg- 
C, endoGlu-C and staphylococcus V8-protease. The digested fragments can be purified 
by, for example, high performance liquid chromatographic (HPLC) techniques. 

1 0 Recombinant nucleic acid techniques can also be used to produce such portions. 

[00861 As used herein, the term ^ds-acting sequence**, ^'cis-acting element* or "cisr- 
regulatory region** or *^regulatory region^* or similar term shall be taken to mean any 
sequence of nucleotides, which when positioned appropriately relative to an expressible 
genetic sequence, is capable of regulating, at least in part, the expression of the genetic 

15 sequence. Those skilled in Ae ait will be aware that a cfa-regulatory region may be 
capable of activating, silencing, enhancing, repressing or otherwise altering the level of 
expression and/or cell-type-specificity and/or developmental specificity of a gene 
sequence at the transcriptional or post-transcriptional level. In certain embodiments of 
the present invention, the cfo-acting sequence is an activator sequence that enhances or 

20 stimulates the expression of an expressible genetic sequOTce.Throughout this 
specification, unless the context requites otherwise, the words "comprfec", ^'camprise^ 
and ^^comprislng"* will be understood to imply the inclusion of a stated step or element 
or group of steps or elements but not the exclusion of any other step or element or group 
of steps or elements. 

25 [0087] By ^'corresponds to** or ^'corresponding id* is meant a polynucleotide (a) 
having a nucleotide sequence that is substantially identical or complementary to all or a 
portion of a ref^ence polynucleotide sequence or (b) encoding an amino acid sequence 
identical to an amino acid sequence in a peptide or protem. This phrase also includes 
within its scope a peptide or polypeptide having an amino acid sequence that is 

30 substantially identical to a sequence of amino acids in a reference peptide or protein, 

[00881 As used herein, the terms "culturing^, ^'culture** and the like refer to the set 
of procedures used in vitro where a population of cells (or a single cell) is incubated 
under conditions which have been shown to support the growth or maintenance of the 

.29- 



P:\OpeiWpa\VPA PiovisioaalsNFusarlumprovisioimldDC 



25 September 2003 



cells in vitro. The art recognises a wide number of formats, media, tCTiperature rangps, 
gas cottcmtrations etc. which need to be defined in a culture system. The parameters 
will vary based on the format selected and the specific needs of the individual who 
practices the mefliods herein disclosed. However, it is recognised that the determination 
5 of culture parameters is routine in nature. 

[0089] By ^^disease resistance*^ is intended that plants avoid or suppr^s the disease 
symptoms that are the outcome of plant-pathogen interaction. That is, pathogens are 
prevented fi-om causing plant diseases and the associated disease symptoms. Hie 
methods of the invention can be utilised to protect plants bom disease, particularly 
10 those diseases that are caused by plant pathogens, such as Fusarium wilt. 

[0090] By ^"^expression vector^* is meant any autonomous genetic element capable 
of directing the transcription of a polynucleotide contained within the vector and 
suitably the synthesis of a peptide or polypeptide encoded by the polynucleotide. Such 
expression vectors are known to practition^ in the art. 

IS [0091] The term ^^gene'* as used herein refens to any and all discrete coding regions 
of the cell's genome, as well as associated non-coding and regulatory regions. The gme 
is also intended to mean the open reading frame encoding specific polypeptides, introns, 
and adjacent 5' and 3' non-coding nucleotide sequences involved in the regulation of 
expression. Li this regard, the gene may further comprise control signals such as 

20 promoters, enhancers, termination and/or polyadenylation signals that are naturally 
associated with a given gene, or heterologous control signals. The DNA sequences may 
be cDNA or genomic DNA or a fi^graent thereof The gene may be introduced into an 
appropriate vector for extrachromosomal maintenance or for integration into the host. 

[0092] The terms "growtf/iff" or '*regeneration" as used herein mean growmg a 
25 whole, difierentiated plant fixnn a plant cell, a group of plant cells, a plant part 
(includmg seeds), or a plant piece (e.g., fiom a protoplast, callus, or tissue part). 

[0093] ^'Hybridisation^* is used herein to denote the pairing of complementary 

nucleotide sequences to produce a DNA-DNA hybrid or a DNA-KNA hybrid 

Complementary base sequences are those sequences that are related by the base-pairing 

30 rules. In DNA, A pahs with T and C pairs with G. hi RNA U pairs with A and C pairs 

with G. In this regard, the terms '*match" and '"mismatch" as used herein ref^ to the 

hybridisation potralial of paired nucleotides in complementary nucleic acid strands. 

Matched nucleotides hybridise efBciently, such as the classical A-T and G-C base pair 
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mentioned above. Mismatches ate other combinations of nucleotides that do not 
hybridise efficiently. 

[0094] Reference herein to 'Hmmuno-interactive^' includes reference to any 
interaction, reaction, or other form of association between molecules and m particular 
5 where one of the molecules is, or mimics, a component of the immune system. 

[0095] By *'isolatect^ is meant material that is substantially or essentially free from 
components that normally accompany it in its native state. For example, an "isolated 
polynucleotide", as used hereto, refers to a polynucleotide, which has been purified 
from the sequences which flank it m a naturally-occurring state, e,g., a DNA fragment 
10 which has been removed from the sequences that are normally adjacent to the fragment. 
Alternatively, an "isolated peptide" or an "isolated polypq>tide** and the like, as used 
herein, refer to in vitro isolation and/or purification of a peptide or polypeptide 
molecule from its natural cellular environment, and from association with other 
components of the cell, i.e., it is not associated with in vivo substances. 

15 [0096] By ^'marker gene''* is meant a gene that imparts a distinct phenotype to cells 
expressing the marker ^ne and thus allows such transformed cells to he distmguished 
from cells that do not have the marker. A selectable marker gene confers a trait for 
which one can 'select' based on resistance to a selective agent {e,g.y a herbicide, 
antibiotic, radiation, heat, or other treatment damaging to untransformed cells). A 

20 screenable marker gene (or reporter gene) confers a trait tiiiat one can identify through 
observation or testing, le., by 'screwing* ie.g, p-glucuronidase, luciferase, or oflier 
enzyme activity not present m untransformed cells). 

100971 As used herein, a '*naturalfy^cciinin^' nucleic acid molecule refers to an 
RNA or DNA molecule having a nucleotide sequence that occurs in nature. For 
25 example a naturaily-occurring nucleic acid molecule can encode a natural protem. 

[00981 By ^*obtainedfroni^ is meant that a sample such as, for example, a nucleic 
acid extract or polypeptide extract is isolated from, or derived from, a particular source. 
For example, the extract may be isolated directiy from any memhrane-translocating 
sequence-contaming organism, such as but not limited to hacteria, yeast and plants as 
30 weQ as animals including mammals, birds, reptiles, fish and insects. 

[0099] The term ^ oligonucleotide^ as used hereto ref^ to a polymer con^osed of 
a multiplicity of nucleotide residues (deoxyribonucleotides or ribonucleotides, or related 
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Structural variants or synthetic analogues thereoj^ including nucleotides with modified 
or substituted sugar groups and the like) linked via phosphodiest^ bonds (or related 
structural variants or synthetic analogues thereof). Thus, while the term 
"oligonucleotide" typically refers to a nucleotide polymer in which the nucleotide 
5 residues and linkages between them are naturally-occurring, it will be understood that 
the term also includes within its scope various analogue including, but not restricted to, 
p^tide nucleic acids (PNAs), phosphorothioate^ phosphoroditiiioate, 
phophoroselenoate, phosphorodiselenoate, phosphoroanilothioate, phosphoraniladate, 
phosphoroamidate, methyl phosphonates^ 2-0-methyl ribonucleic acids, and the like. 

10 The exact size of the molecule can vary dq>ending on the particular application. 
Oligonucleotides are a polynucleotide subset with 200 bases or fewer in length* 
Preferably, oligonucleotides are 10 to 60 bases in Iragth and most preferably 12, 13, 14, 
15, 16, 17, 18, 19, or 20 to 40 bases in length. Oligonucleotides are usually smgle 
stranded, e-g., for probes; although oligonucleotides may be double stranded, e.g., for 

IS use in the construction of a variant nucleic acid sequence. Oligonucleotides of the 
invention can be either sense or antisoase oligonucleotides. 

[00100] The term ^^operably connected^ or *^operably linkeif^ as used herein means 
placing a structural gene under the regulatory control of a promoter, which then controls 
the transcription and optionally translation of the gene* In the construction of 

20 heterologous promoter/structural gene combinations, it is generally preferred to position 
the genetic sequence or promoter at a distance fix)m the gene transcr^>tion start site that 
is approximately the same as the distance between that genetic sequence or promoter 
and the gene it controls in its natural setting; i.e. the gene fiom which the genetic 
sequence or {Kromoter is derived. As is known in the art, some variation in fliis distance 

25 can be accommodated without loss of function. Similarly, the preferred positioning of a 
regulatory sequence element with respect to a heterologous gene to be placed under its 
control is defined by the positioning of the client in its natural setting; Le., the genes 
fix>m which it is derived. 

[0100] The term ^pathogen'* is used heceia in its broadest sense to refer to an 
30 organism or an infectious a^nt whose infection of cells of viable plant tissue elicits a 
disease response. 

[0101] The tenn **pofynucIeoiide^' or **nucleic acid* as used h^ein designates 
mRNA, RNA, cRNA, cDNA or DNA. The term typically refers polymeric form of 
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nucleotides of at least 10 bases in length, ei&er ribonucleotides or deoxynucleotides or 
a modified form of ei&er type of nucleotide. The term includes single and double 
stranded forms of DNA. 

[0102] The tenns "polynucleotide variant and ^'vartanf* refer to polynucleotides 
S displaying substantial sequence identity with a reference polynucleotide sequence or 
polynucleotides that hybridise with a reference sequence under stringent conditions that 
are defined hereinafter. These terms also encompass polynucleotides in which one or 
more nucleotides have been added or deleted, or i^laced witii dififerent nucleotides. In 
this regard, it is well tmderstood in the art tixat certain alterations inclusive of mutations, 
10 additions, deletions and substitutions can be made to a reference polynucleotide 
whereby the altered polynucleotide retains a biological function or activity of the 
reference polynucleotide. The terms '^polynucleotide variant and *\arianf^ also include 
naturally-occurring allelic variants. 

[0103] ''Polypq)tide^\ peptide"' and '^protein" are used interchangeably herein to 
15 refer to a polymer of amino acid residues and to variants and synthetic analogues of the 
same. Thus, these terms apply to amino acid polymers in which one or more amino acid 
residues is a synthetic non-^naturally-occuiring amino acid, such as a chemical analogue 
of a corresponding naturally-occurring amino acid, as well as to naturally-occurring 
amino acid polymers. 

20 [0104] The term '^polypeptide variant refers to polypeptides which are 
distinguished from a reference polypeptide by the addition, deletion or substitution of at 
least one amino acid residue. In certain embodiments, one or more amino acid residues 
of a reference polypeptide are replaced by different amino acids. It is well understood in 
the art that some amino acids may be changed to others with broadly similar properties 

25 without changing the nature of the activity of the polypeptide (conservative 
substitutions) as described hereinafter. 

[01051 By ''primer"^ is meant an oligonucleotide which, when paired with a strand 
of DNA, is capable of initiating flie synthesis of a primer extension product in ttie 
presence of a suitable polymerising agent. The primer is preferably single-stranded for 
30 maximum efficiency in amplification but can alternatively be double-stranded. A primer 
must be sufficientiy long to prime the synthesis of extension products in the presence of 
the polymerisation agent. The lengtti of the primer depends on many factors, includmg 
application, temp^ature to be employed, template reaction conditions, oth^ reagents, 
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and source of primers. For example, dqiending on ttie complexity of the target 
sequence, the oKgonucleotide pnmer typically contains 15 to 35 or more nucleotide 
residues, although it can contdn fewer nucleotide residues. Primers can be large 
polynucleotides, such as &om about 35 nucleotides to several kilobases or more. 

5 Primers can be selected to be "substantially complementary^' to flie sequence on the 
template to which it is designed to hybridise and serve as a site for the mitiation of 
synthesis. By "substantially complementary*', it is meant that the primer is suflSciently 
complementary to hybridise with a tai^et polynucleotide. Preferably, the primer 
contams no mismatches with the template to which it is designed to hybridise but this is 

10 not essential. For example, non-complementary nucleotide residues can be attached to 
the 5' end of the primer, with the remainder of the prnner sequence being 
complementary to the tenq>late. Alternatively, non-complementary nucleotide residues 
or a stretch of non-complementary nucleotide residues can be interspersed into a primer, 
provided ftat the primer sequence has sufficient complementarity with the sequOTce of 

15 the template to hybridise tfierewifli and ther^y form a template for synthesis of the 
extension product of ttie primer. 

[0106] "Profte" refers to a molecule that binds to a specific sequence or sub- 
sequence or other moiety of another molecule. Unless otherwise indicated, the term 
'*probe'* typically refers to a polynucleotide probe that binds to another polynucleotide, 
20 often called flie "target polynucleotide**, through complementary base pairing. Probes 
can bind target polynucleotides lacking complete sequence complementarity witii the 
probe, depending on the stringency of the hybridisation conditions. Probes can be 
labelled directly or indirectly. 

[0107] The term ^'recombinant pofynudeotide'' as used herein refers to a 
25 polynucleotide formed in vitro by the manipulation of nucleic acid into a form not 
normally found in nature. For example, the recombinant polynucleotide may be in the 
form of an expression vector. Gently, such expression vectors include transcriptional 
and translational regulatory nucleic acid operably linked to the nucleotide sequence. 

[0108] By 'Recombinant polypeptide" is meant a polypeptide made using 
30 recombinant techniques, i.e., through the expression of a recombinant or synthetic 
polynucteotide. 

[0109] By ''regulatory element'' or "regulatory element is meant nucleic acid 
sequences (e.g,, DNA) necessary for expression of an operably linked coding sequence 
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in a particular host cell. The regulatory sequences that are suitable for piokaryotic cells 
for example, include a promoter, and optionally a cto-acting sequence such as an 
operator sequence and a ribosome binding site. Control sequences that are suitable for 
eukaryotic cells include promoters, polyadenylation signals, transcriptional enhancers, 
S tianslational enhancers, leader or trailing sequences that modulate mRNA stability, as 
well as targeting sequences that target a product encoded by a transcribed 
polynucleotide to an intracellular compartment within a cell or to the extracellular 
environment. 

lOllO] The term **sequence identuy" as used harem refers to flie extent that 
10 sequences are identical on a nucleotide-by-nucleotide basis or an amino acid-by-«mino 
acid basis over a window of comparison. Thus, a ^'percentage of sequence identit/* is 
calculated by comparing two optimally aligned sequences over the window of 
conqiarison, detemiining the number of positions at which the identical nucleic acid 
base (e.g.. A, T, C, G, J) or the id«itical amino acid residue (e.g., Ala, Pro, Ser, Ttar, 
15 Gly, Val, Leu, He, Phe, Tyr, Tip, Lys, Arg, ffis, Asp, Glu, Asn, Gin. Cys and Met) 
occurs in bofli sequences to yield the number of matched positions, dividing the number 
of matched positions by the total number of positions in the window of comparison (Le., 
the window size), and multiplying the result by 100 to yield tiie percentage of sequence 
identity. For the purposes of flie present invention, *^sequence identity* will be 
20 understood to mean the "match percentage*' calculated by the DNASIS computer 
program (Version 2.5 for windows; available fiom Hitachi Software engineering Co., 
Ltd., Sottfli San Francisco, California, USA) usmg standard defaults as used in the 
reference manual accompanying flie software. 

[0111] ^'Similarity"" refers to the percentage number of amino acids that are 
25 identical or constitute conservative substitutions as defined in Table B infra. Similarity 
maybe determined using sequence comparison programs such as GAP (Deveraux et al, 
1984, Nucleic Acids Research 12, 387-395). In this way, sequences of a similar or 
substantially different length to tiiose cited herein might be compared by insertion of 
gs^ into the aligmnent, such gaps bemg determined, for ^cample, by the comparison 
30 algorithm used by GAP. 

[0112] Terms used to describe sequence relationships between two or more 
polynucleotides or polypeptides include **referOTce sequence", "comparison wmdow", 
"sequence identity", '^percentage of sequence identity^ and "substantial idCTtrity*'. A 
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^>eference sequence^ is at least 12 but frequently 15 to 18 and often at least 25 
monomer units, inclusive of nucleotides and amino acid residues, in length. Because 
two polynucleotides may each comprise (1) a sequence (j.e.^ only a portion of flie 
complex polynucleotide sequence) that is similar between the two polynucleotides, and 

5 (2) a sequence that is diverg^t between the two polynucleotides, sequence con^arisons 
between two (or more) polynucleotides are typically performed by con^iaring 
sequences of the two polynucleotides over a ^comparison window" to identify and 
conyjare local regions of sequence similarity. A ^^comparison windoW^ refeis to a 
concq>tual segment of at least 6 contiguous positions, usually about 50 to about 100, 

10 more usually about 100 to about 150 in which a sequence is compared to a referaice 
sequence of tiie same number of contiguous positions after tiie two sequences are 
optimally aligned. The comparison window may comprise additions or deletions {i.e., 
gaps) of about 20% or less as compared to tiie reference sequence (which does not 
comprise additions or deletions) for optimal alignment of the two sequences. Optimal 

15 alignmrat of sequences for aligning a comparison window may be conducted by 
computerised in^lementations of algorithms (GAP, BESTFTT, FASTA, and TFASTA 
in the Wisconsin Genetics Software Package Release 7.0, Genetics Computer Group, 
575 Science Drive Madison, WI, USA) or by inspection and the best alignment (i.e., 
resulting in the highest percentage homology over the comparison window) generated 

20 by any of the various methods selected. Reference also may be made to the BLAST 
family of programs as for example disclosed by Altschul et aL, 1997, Nucl. Acids Res. 
25:3389. A detailed discussion of sequence analysis can be found in Unit 19.3 of 
Ausubel et al, "Current Protocols in Molecular Biology", John Wiley & Sons Inc, 
1994-1998, Chapter 15* 

25 [01131 By the term "taxon" herein is meant a unit of botanical classificatiorL It 
thus includes, genus, species, cultivars, varieties, variants and other minor taxonomic 
groxips which lack a consistent nomenclature. 

[01141 The term "transformation"^ means alteration of the genotype of an organism, 
for example a bacterium, yeast or plant, by the introduction of a foreign or endo^ous 
30 nucleic acid. 

[0115] By "vector*' is meant a polynucleotide molecule, preferably a DNA 
molecule derived, for example, from a plasmid, bacteriophage, yeast or virus, into 
whidi a polynucleotide can be inserted or cloned. A vector preferably contains one or 
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more unique restriction sites and can be capable of autonomous replication in a defined 
host cell including a target cell or ti^e or a progenitor cell or tissue thereof or be 
integrable with tiie genome of the defined host such that the cloned sequence is 
reproducible. Accordingly, the vector can be an autonomously r^licating vector, i.e., a 

5 vector that exists as an extrachromosomal ratity, the rq)Iication of which is independent 
of chromosomal replication, e.g., a linear or closed cnroular plasmid, an 
extrachiomosomal element^ a minichromosome, or an artificial chromosome. The 
vector can contain any means for assuring self-replication. Alternatively, the vector can 
be one which, when introduced into the host cell, is mtegrated into the genome and 

10 replicated together with the chrom6some(s) mto which it has been integrated. A vector 
system can comprise a single vector or plasmid, two or more vectors or plasmids, which 
togettier contain the total DNA to be introduced into the genome of the host cell, or a 
transposon. The choice of the vector will typically depend on the con^>atibility of the 
vector wifli the host cell into which the vector is to be introduced In the present case, 

15 the vector is preferably a viral or viral-derived vector, which is operably fimctional in 
animal and preferably mammalian cells. Such vector may be derived from a poxvirus, 
an adraovirus or yeast. The vector can also include a selection marker such as an 
antibiotic resistance gene that can be used for selection of suitable transformants. 
l^amples of such resistance genes are known to those of skill in the art. 

20 2, Modulation of disease resistance 

101161 The inveation is drawn to polynucleotide, polypeptide and methods for 
modulating disease resistance, especially for stimulating or enhancmg disease resistance 
in plants, caused by pathogens. Pathogens of the invention include, but are not limited 
to, viruses or viroids, bacteria, insects, nematodes, fimgi, and the like. Viruses include 

25 any plant virus, for example, tobacco or cucumber mosaic virus, ringspot virus, necrosis 
virus, maize dwarf mosaic virus, etc. Specific fimgal and viral pathogens for the major 
crops include: 

[01171 Soybeans: Phytophthora megasperma ftp. glycinea, Macrophomina 
phasBQliiuL, Rhizoctonia solani, Sclerotinia sclerotiorum, Fusarium oxysporum, 
30 Diaparihe phaseolorum var. sojae (Phomopsis sojae), Diaporthe phaseolorum var. 
cauHvora, Sclerotium rolfsiU Cercospora kikuchii, Cercospora sojina^ Peronospora 
manshurica, Colletotrichum dematium {Colletotichum tnmcatum), Corynespora 
cassiicola, Septoria glycines^ Phyllosticta sojicola, AUemaria altemata^ Pseudomonas 
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syringae p. v. glycinea, Xanthomonas campestris p.v. phaseolU Microsphaera diffusa^ 
Fusarium semitectum, Phialophora gregata^ Soybean mosaic vinxs, Glomerella 
glycines^ Tobacco Ring spot virus. Tobacco Streak virus, Phakopsorapachyrhizl, 
Pythium aphanidermatum^ Pythium ultimum^ Pythium debaryantim. Tomato spotted wilt 
virus, Heterodera glycines Fusarium solani; 

[0118] Canola: Albugo Candida^ Altemarta brassicae^ Leptosphaeria maculans^ 
Bhizoctonia solanU Sclerotinia sclerotiorum^ Mycosphaerella brassiccola^ Pythium 
ultimum, Peronospora parasitica, Fusarium roseum, Altemaria aliernaia; 

[0119] Alfalfa: Clavibater michiganese subsp. insidiosum, Pythium ultimum^ 
Pythium irregulare, Pythium splendenSy Pythium debaryanum, Pythium 
aphanidermatum^ Phytophthora megasperma^ Peronospora trifoliorum, Pkoma 
medicaginis var. medicaginis, Cercospora medicaginis, Pseudopeziza medicaginis, 
Leptotrochila medicaginis^ Fusariumj Xanthomonas campestris p.v. alfalfae, 
Aphanomyces euteiches, Stemphylium herbarum, Stemphylium alfalfae\ 

[0120] Wheat: Pseudomonas syringae p»v. atrofaciens, Urocystis . agropyri, 
Xanthomonas campestris p.v. translucens^ Pseudomonas syringae p.v, syringae, 
Altemaria altemata, Cladosporium herbarum, Fusarium graminearum, Fusariian 
avenaceum, Fusarium culmorum^ Ustilago tritici, Ascochyta tritici, Cephalosporium 
gramineum, Collotetrichum graminicola, Erysiphe graminis f.sp. tritici, Puccinia 
graminis f.sp. tritici^ Puccinia recondita £sp. tritici, Puccinia striiformis, Pyrenophora 
tritici-repentiSj Septoria nodorum, Septoria tritici, Sqjtoria avenae, 
Pseudocercosporella herpotrichoides, Rhizoctonia solani, Rliizoctonia cerealis, 
Gaeumannomyces graminis var. tritici, Pythium aphanidermatum, Pythium 
arrhenomanesy Pythium ultimum, Bipolaris sorofdniana. Barley Yellow Dwarf Virus, 
Brome Mosaic Virus, Soil Bome Wheat Mosaic Virus, Wheat Streak Mosaic Virus, 
Wheat Spindle Streak Virus, American Wheat Striate Virus, Claviceps purpurea, 
Tilletia tritici, Tilletia laevis, Ustilago tritici, Tilletia indica, Rhizoctonia solani, 
Pythium arrhenomannes, Pythium gramicola, Pythium aphanidermatum. High Plains 
Virus, European wheat striate virus; 

[0121] Sunflower Plasmophora halstedii, Sclerotinia sclerotiorum. Aster Yellows, 
Septoria helianthi, Phomopsis helianthi, Altemaria helianthi, Altemaria zinniae, 
Botrytis cinerea, Phoma macdonaldii, Macrophomina phaseolina, Erysiphe 
cichoracearum, Rhizopus oryzae, Rhizopus arrhizus, Rhizopus stolonifer, Puccinia 
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helianthU Vertidllium dahliae^ Erwinia carotovorum pv, carotovora, Cepttalosporium 
acremonhmtj Phytophthora cryptogea. Albugo tragopogonis; 

[01 22J Com: Fusarium moniliforme var. subglutmans^ Erwinia stewariii, Fusarium 
monihforme^ Gibberella zeae {Fusarium graminearum), Stenocarpella maydi (Diplodia 

5 maydis\ Pythium irregulare, Pythium debaryanum^ Pythium graminicola^ Pytkium 
splendens, Pythium ultimum, Pythium aphanidermatum^ Aspergillus Jlavus^ Bipolaris 
maydis O, T (Cochliobolus heterostrophus), Helminthosporium carbonum I, II & III 
{Cochliobolus carbonum\ Exserohilum turcicum I, II & m, Helminthosporium 
pedicellatum, Pkysoderma mcydis, Phyllosticta maydis^ Kabatiella-maydis^ Cercospora 

10 sorghi^ Ustilago maydis^ Puccinia sorghi, Pucdnia polysora^ Macrophomina 
phaseolina, Penicillium oxalicum^ Nigrospora oryzae^ Cladosporium herbarum, 
Curvularia lunata^ Curvularia inaequalis, Curvularia pallescens, Clavibacter 
michiganense subsp. nebraskense, Trichoderma viride. Maize Dwarf Mosaic Virus A & 
B, Wheat Streak Mosaic Virus, Maize Chlorotic Dwarf Virus, Claviceps sorghiy 

15 Pseudonomas avenae^ Erwinia chrysanthemi pv. zea, Erwinia carotovora^ Com stunt 
spiioplasma, Diplodia macrospora^ Sclerophthora macrospora^ Peronosclerospora 
sorghiy Peronosclerospora philippinensis, Peronosclerospora maydis^ 
Peronosclerospora saccharic Sphacelotheca reiliana^ Physopella zeae, Cephidosporium 
maydis, Cephalosporium acremonium. Maize Chlorotic Mottle Virus, High Plains 

20 Virus, Maize Mosaic Vuus, Maize Rayado Fino Virus, Maize Streak Virus, Maize 
Stripe Virus, Maize Rou^ Dwarf Virus; 

[0123] Sorghum: Exserohilum turcicum, Colletotrichum graminicola {Glomerella 
graminicola), Cerco^ora sorghi, Gloeocercospora sorghi, Ascochyta sorghina, 
Pseudomonas syringae p.v. syringae, Xanthomonas campestris p.v. hohdcola, 

25 Pseudomonas andropogonis, Puccinia purpurea, Macrophomina phaseolina, Perconia 
circinata, Fusarium moniliforme, Altemaria altemata, Bipolaris sorghicola, 
Helminthosporium sorghicola, Curvularia lunata, Phoma insidiosa, Pseudomonas 
avenae (Pseudomonas alboprecipitans), Ramulispora sorghi, Ramulispora sorghicola, 
PhyUachara sacchari, Sporisoriim reilianum (Sphacelotheca reiliana), Sphacelotheca 

30 omenta, Sporisorium sorghi. Sugarcane mosaic H, Maize Dwarf Mosaic Vims A & B, 
Claviceps sorghi, Rhizoctonia solani, Acremonium strictum, Sclerophthona 
Tnacrospora, Peronosclerospora sorghi, Peronosclerospora philippinensis, Sclerospora 
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grammicola, Fusarium gramineamm^ Fusarium axysporum^ Pythitm arrhenomanes^ 
Pythhm graminicola^ etc. 

[0124] Nematodes include parasitic nematodes such as root-knot^ cyst, lesion, and 
lenifomi nematodes, etc 

5 [0125] Insect pests include insects selected from the orders Coleoptera, Dipt^ 
Hymenoptera, Lepidoptera, Mallophaga, Homoptera, Hemiptera, Orthoptera, 
Thysanoptera, Deimaptera, Isoptera, Anoplura, Siphon^tera, Trichoptera, etc., 
particularly Coleoptera and Lepidoptera. Insect pests of the invention for the major 
crops include: 

10 [0126] Maize: Ostrinia nubilalis, European com borer, Agrotis ipsilon, black 
cutworm; Helicoverpa zea^ com earworm; Spodoptera frugiperda, fall armyworm; 
Diatraea grandiosella, southwestern com borer; Elasmopalpus lignosellus, lesser 
cornstalk borer; Diatraea saccharalis^ surgarcane borer; Diabrotica virgifera^ western 
com rootworm; Diabrotica longicomis barberi, northern com rootwomi; Diabrotica 

15 undecimpunctata howardi^ southern com rootworm; Melanotus spp., wireworms; 
Cyclocephala borealiSy northern masked chafer (white gmb); Cyclocephala 
immaculata, southern masked chafer (white gmb); Popillia japonica^ Japanese beetle; 
Chaetocnema pulicaria, com flea beetle; Sphenophorus maidis, maize billbug; 
Rhopalosiphum maidis, com leaf aphid; Anuraphis maidiradicis, com root aphid; 

20 Blissus leucopterus leucopterus^ chinch bug; Afelanoplusfemtprubrum, redlegged 
grasshopper; Melanoplus sanguinipes, migratory grasshopper; Hylemya platura, 
seedcom maggot; Agrontyza parvicomis, com blot leafininer; Anaphothrips obscrurus^ 
grass thrips; Solenopsis milesta, thief ant; Tetranychus urticae, twospotted spider mite; 

[0127] Sorghum: Chilo partellus, sorgjium borer; Spodopterafrugiperda^ fall 
25 armyworm; Helicoverpa zea, com earworm; Elasmopalpus lignosellus, lesser cornstalk 
borer; Feltia subterranea, granulate cutworm; Phyllophaga crinita, white grub; 
Eleodes^ Conoderus, and Aeolus spp., wireworms; Oulema melanopus, cereal leaf 
beetle; Chaetocnema pulicaria^ com flea beetle; Sphenophorus maidis, maize billbug; 
Rhopalosiphum maidis; com leaf aphid; Sipha flava, yellow sugarcane aphid; Blissus 
30 leucopterus leucopterus^ chinch bug; Contarinia sorghicola^ sorghima midge; 
Tetranychus cinnabarinus^ carmine spider mite; Tetranychus urticae^ twospotted spider 
mite; Wheat: Pseudaletia unipunctata, army worm; Spodoptera Jrugiperday fall 
armjrworm; Elasmopalpus lignosellus, lesser cornstalk borer; Agrotis orthogonia, 
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western cutworm; Elasmopalpus lignosellus, lesser cornstalk borer; Oulema melanopus, 
cereal leaf beetle; Hypera punctata^ clover leaf weevil; Diabrotica undecimpunctata 
howardi, southern com rootworm; Russian wheat aphid; Schizaphis graminum, 
greenbug; Macrosiphum avenae, English grain ^hid; Melanoplus femurrubrum^ 
5 redlegged grasshopper; Melanoplus differentialis, differential grasshopper; Melanoplus 
sanguinipes, migratory grasshoppen Mqyetiola destructor, Hessian fly; Sitodiplosis 
mosellana^ wheat midge; Meromyza ammicana, wheat stem maggot; Hylemya 
coarctata, wheat bulb fly; FranHiniella fusca, tobacco thrips; Cephus cinctus, wheat 
stem sawfl^ Aceria tulipae, wheat curl mite; Sunflower: Suleima helianthana, 

10 sunflower bud moth; Homoeosoma electellum, sunflower moth; zygogramma 
exclamationis, sunflower beetle; Bothyrus gibbosus, carrot beetle; Neolasioptera 
murtfeldtiana, sunflower seed midge; Cotton; Heliothis virescens, cotton budworm; 
Helicaverpa zea, cotton bollwonn; Spodoptera exigua, beet armywonn; Pectinophora 
gossypiella, pink bollworm; Anthonomus grandis grandis, boll weevil; Aphis gossypiiy 

15 cotton ^hid; Pseudatomoscelis seriatus, cotton fleahopper; Trialeurodes abutiloneaj 
bandedwinged whitefly; Lygus lineolariSy tarnished plant bug; Melanoplusfemurrubrum, 
redlegged grasshopper; Melanoplus differentialis, differential grasshoppen Hurips 
tabaci, onion thrips; Franldinfdella Jusca, tobacco thrips; Tetranychus cinnabarinus, 
carmine spider mite; Tetranychus urticae, twospotted spider mite; 

20 [0128] Rice: Diatraea saccharalis, sugarcane borer; Spodoptera frugiperda^ fall 
armywonn; Helicoyerpa zea, com earwonn; Colaspis brunnea, grape colaspis; 
Lissorhoptrus oryzopMlus, rice water weevil; Sitophilus oryzae, rice weevil; Nephotettix 
nigropictus, rice leafhopp^; Blissus leucopterus leucopteruSy chinch bug; Acrostemum 
hilare, green stink bugg 

25 [0129] Soybean: Pseudoplusia includens, soybean looper; Anticarsia gemmatalis, 
velvetbean caterpillar, Plathypena scabra^ green cloverworm; Ostrinia nubilalis, 
European com boren Agrotis ipsilon, black cutworm; Spodoptera exigua, beet 
armyworm; Heliothis virescens, cotton budworm; Helicaverpa zea, cotton bollworm; 
Epilachna varivestis, Mexican bean beetle; Myzus persicae, green peach aphid; 

30 Empoasca fabae, potato leafibopper; Acrostemum hilare, green stink bug; Melanoplus 
fsmmrubrwn, redlegged grasshopper; Melanoplus differentialis, differential 
grasshopper, Hylen^a platura, seedcom maggo^, Sericothrips variabilis, soybean 
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thrips; Thrips tabacU onion thrips; Tetranychus turkesianU strawbOTy spvSisi mite; 
Tetranychus urticae^ twospotted spider mite; 

[0130] Barley: Ostrinia nubilalis^ Euiopean com borer; Agrotis ipsilon^ black 
cutworm; Schizaphis graminuniy greenbug; Blissus leucoptems leucopterus, chindi bug; 
5 Acrosiemim hilare^ green stink bug; Euschistus servusy brown stink bug; Delia pUuura, 
seedcom maggot; Mayetiola destructor^ Hessian f\y\ Petrobia latens^ brown wheat 
mite; Oil Seed Rape: Brevicoryne brassicae^ cabbage aphid; Phyllotreta cmciferae. 
Flea beetle; Mamestra conftgurata. Bertha aimyworm; Plutella xylostella^ Diamond- 
back moth; Delia ssp.. Root m^gots. 

10 [0131] In certain embodiments, the plant pathogen is selected from fimgi, 
especially soil borne fungi such as Fusarium oxysporum, Verticillium dahliae, 
Cladosporium and Ralstona Solanaceum, 

3. Polynucleotides of the invention 

[0132] The present invention is predicated, in part, on the isolation of two novel R 
IS graes fix>m banana, one fiom Musa acuminata Calcutta 4 and the other fiom Musa 
acuminata spp malaccensis. The Calcutta 4 gene designated RGAS is 4380 nts long and 
comprises a smgle open reading frame of 4321 nts that encodes a 1441-aa putative 
polypeptide product The nucleotide sequence of this gene and its deduced polypeptide 
sequence are presented in SEQ ID NO: 1 and 2, respectively. The Musa acuminata spp 
20 malaccensis gene designated RGA2 comprises a single open reading frame of 3699 nts, 
which encodes a putative polypeptide product of 1232 aa. The nucleotide sequence of 
the RGA2 gene and its deduced polyp^tide sequence are presented in SEQ ID NO: 3 
and 4, respectively. 

[0133] In accordance with the present invention, the novel R genes will be usefiil 
25 for &cilitating the construction of crop plants that are resistant to pathogenic disease, 
especially disease caused by fungal pathogens, viruses, nematodes, insects and the like. 
The R genes of the present invration can also be used as markers in genetic mq)ping as 
well as in assessing disease resistance in a plant of interest. Thus, the sequences can be 
used in breeding programs. See, for example, Gentzbittel et al (1998, T%eor. AppL 
30 Genet 96:519-523). Additional xises for the sequences of the invention include using 
the sequences as bait to isolate other signalling components on defence/resistance 
pathwaj^ and to isolate the corresponding'promoter sequences. The sequences may also 
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be used to modulate plant development processes, such as pollm development, 
regulation of organ shape, differentiation of aleuione and shoot epidennis^ embryogenic 
competence, and cell/cell interactions. See, generally, Sambrook et al (1989) Molecular 
Cloning: A Laboratory Manual (2nd ed., Cold Spring Harbor Laboratory Press, 
5 Plainview, N.Y.)- The sequences of the present invention can also be used to generate 
variants (e.g., by domain swapping') for tibie generation of new resistance specificities, 

[0134] The invention encompasses isolated or substantially purified nucleic acid or 
protein compositions. An **isolated" or **purified" nucleic acid molecule or protein, or 
biologically active portion tihereof, is substantially or essmtially fi-ee from conqxinents 

10 that normally accompany or interact with the nucleic acid molecule or protein as found 
in its naturally occurruig environment* Thus, an isolated or purified polynucleotide or 
polypeptide is substantially free of other cellular material, or culture medium when 
produced by recombinant techniques, or substantially free of chemical precursors or 
other chemicals when chemically synthesised. Suitably, an •isolated" polynucleotide is 

IS free of sequences (especially protein encoding sequences) ttiat naturally flank Uie 
polynucleotide (i.e., sequences located at tiie 5 ' and 3' ends of the polynucleotide) in the 
genomic DNA of the oi;ganism from which flie polynucleotide was derived. For 
example, in various embodiments, the isolated polynucleotide can contain less than 
about 5 ld>, 4 kb, 3 kb, 2 kb, 1 kb, 0,5 kb. or 0.1 kb of nucleotide sequences that 

20 naturally flank the polynucleotide m genomic DNA of the cell fixjm which the • 
polynucleotide was derived. A polypeptide that is substantially free of cellular material 
mcludes preparations of protein havmg less than about 30%, 20%, 10%, 5%, (by dry 
weight) of contaminatmg protein. When the protein of the invention or biologically 
active portion thereof is recombinantly produced, culture medium suitably represents 

25 less than about 30%, 20%, 10%, or 5% (by dry weight) of chemical precursors or non- 
protein-of-interest chemicals. 

[0135] The present invention also encompasses portions of the disclosed nucleotide 
sequences. Portions of a nucleotide sequence may encode polypeptide portions or 
segments Oat retain the biological activity of the native polypeptide and hence 
30 modulate or regulate disease resistance. Alternatively, portions of a nucleotide sequence 
thai are usefril as hybridisation probes generally do not encode amino acid sequences 
retaining such biological activity. Thus, portions of a nucleotide sequence may range 
from at least about 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 40. 50, 
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60, 80, 90, 100 nucleotides, or almost up to the fiiU-lenglh nucleotide sequence 
encoding the polypq;)tides of the invention. 

10136] A portion of an R nucleotide sequence that encodes a hiologically active 
portion of an R poljpeptide of the invention will encode at least about 5, 6, 7, 8, 9, 10, 

5 11, 12, 13. 14. 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25. 30. 40, 50, 60, 70. 80, 90, 100, 
120, 150, 300, 400, 500, 600, 700, 800, 900 or 1000 contiguous amino acid residues, or 
almost up to the total number of amino acids present in a fiiU-length R polypeptide of 
the invention (for example, 1440 or 1330 ammo acid residues for SEQ ID NO: 2 or 4, 
respectively). Portions of an R nucleotide sequence that are use&l as hybridisation 

10 probes or PGR primers generaUy need not encode a biologically active portion of an R 
polypeptide. 

(01371 Thus, a portion of an R nucleotide sequence may encode a biologically 
active portion of an R polypeptide, or it may be a fragment that can be used as a 
hybridisation probe or PGR primer using standard methods known in the art A 
15 biologically active portion of an R polypeptide can be prq)ared by isolating a portion of 
one of flie R nucleotide sequences of tiie mvention, expressing the encoded portion of 
the R polypeptide (e.g., by recombinant expression in vitro), and assessmg the activity 
of tiie encoded portion of the R polypeptide. Nucleic acid molecules that are portions of 
an R nucleotide sequence comprise at least about 15, 16, 17, 18, 19, 20, 25, 30, 50, 75, 
20 100, 150, 200, 250, 300, 350, 400, 450, 500. 550, 600, or 650 nucleotides, or ahnost vp 
to tiie number of nucleotides present in a full-length R nucleotide sequence disclosed 
herein (for example, 4375 or 3690 nucleotides for SEQ ID NO: 1 or 14, respectively). 
10138] The mvention also contemplates variants of the disclosed nucleotide 
sequences. Nucleic acid variants can be naturaUy occurrmg. such as allehc variants (same 
25 locus), homologues (different locus), and orthologues (different organism) or can be non 
naturally occurring NaturaUy occurring variants such as tiiese can be identified witii the 
use of well-known molecular biology techniques, as, for example, witii polymerase 
chain reaction (PGR) and hybridisation techniques as known in tiie art Non-naturally 
occurring variants can be made by mutagenesis techniques, mchiding tiiose appUed to 
30 polynucleotides, cells, or organisms. The variants can contain nucleotide substitutions, 
deletions, inversions and insertions. Variation can occur m eitiier or botti flie coding and 
non-coding regions. The variations can produce bofli conservative and non<onservative 
amino acid substitutions (as compared m flie encoded product). For nucleotide sequences, 
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conservative variants include those sequences that, because of the degeneracy of the 
genetic code, encode the amino acid sequence of one of the R polypeptides of the 
invention. Variant nucleotide sequences also include syntheticaUy derived nucleotide 
sequences, such as those generated, for exanq)!e, by using site-directed mutagenesis but 
5 whicb still encode an R polypeptide of the invention. Generally, variants of a particular 
nucleotide sequence of the invention will have at least about 30%, 40% 50%, 55%, 
60%, 65%, 70%, generaUy at least about 75%, 80%, 85%, desirably about 90% to 95% 
or more, and more suitably about 98% or more sequrace identity to that particular 
nucleotide sequence as detemiined by sequence alignment programs described 
10 elsewhere herein using default parameters. 

101391 The nucleotide sequences of tihe invention can be used to isolate 
corresponding sequences fixsm oflier organisms, particularly other plants. Methods are 
readily available in the art for the hybridisation of nucleic acid sequences. Coding 
sequences from other plants may be isolated according to well known techniques based 
15 on their sequence identity with the coding sequences set forth herdn. hi these 
techniques all or part of the known coding sequence is used as a probe which selectively 
hybridises to other R coding sequraces present in a population of cloned genomic DNA 
fragments or cDNA fragments (i*e., genomic or cDNA libraries) from a chosen 
organism. Accordingly, tiie present invention also contemplates polynucleotides that 
20 hybridise to the R graie nucleotide sequences, or to their complements, under stringOTcy 
conditions described below. As used herein, the term 'Tiybridises under low stringency, 
medium stringency, high stringwicy, or very high stringency conditions" describes 
conditions for hybridisation and washing. Guidance for performing hybridisation 
reactions can be found in Ausubel et al, (1998, supra). Sections 6.3.1-6.3.6- Aqueous 
25 and non-aqueous mefliods are desmbed in that refwnoe and eitiiCT can be used. 
Reference herein to low stringency conditions include and encompass fix>m at least 
about 1% \N to at least about 15% v/v formamide and from at least about 1 M to at 
least about 2 M salt for hybridisation at 42** C, and at least about 1 M to at least about 2 
M salt for washing at 42° C, Low stringency conditions also may include 1% Bovine 
. 30 Serum Albumin (BSA), 1 mM EDTA, 0.5 M NaHP04 (pH 7.2), 7% SDS for 
hybridisation at 65« C, and (i) 2xSSC, 0.1% SDS; or (ii) 0.5% BSA, 1 mM EDTA, 40 
mM NaHP04 (pH 7.2), 5% SDS for washing at room temperature. One embodiment of 
low stringency conditions mcludes hybridisation in 6X sodium chloride/sodium citrate 
(SSC) at about 45*' C, followed by two washes in 0.2X SSC, 0.1% SDS at least at 50* C 
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(file temperature of flie washes can be increased to 55* C for low stringency conditions). 
Medium stringency conditions include and aioompass from at least about 16% v/v to at 
least about 30% v/v formamide and fiom at least about 0,5 M to at least about 0.9 M 
salt for hybridisation at 42<» C, and at l^t about 0. 1 M to at least about 0.2 M salt for 
washmg at 55** C. Medium stringency conditions also may include 1% Bovine Serum 
Albumin (BSA), 1 mM EDTA, 0.5 M NaHP04 (pH 7.2), 7% SDS for hybridisation at 
65** C, and (i) 2 x SSC, 0.1% SDS; or (ii) 0.5% BSA, 1 mM EDTA, 40 mM NaHP04 
(pH 7.2), 5% SDS for washing at 60-65° C. One embodiment of medium stringency 
conditions includes hybridising in 6X SSC at about 45° C, followed by one or more 
washes in 0.2X SSC, 0.1% SDS at 60° C. High stringency conditions include and 
encompass ftom at least about 31% v/v to at least about 50% v/v formamide and fiom 
about 0.01 M to about 0.15 M salt for hybridisation at 42° C, and about 0.01 M to about 
0.02 M salt for washing at 55° C High stringency conditions also may include 1% 
BSA, 1 mM EDTA, 0.5 M NaHPO* (pH 7.2), 7% SDS for hybridisation at 65° C, and 
(i) 0.2 X SSC, 0.1% SDS; or (u) 0,5% BSA, ImM EDTA, 40 mM NaHP04 (pH 7.2), 
1% SDS for washing at a temperature m excess of 65° C. One embodiment of high 
stringency conditions inchides hybridismg in 6X SSC at about 45** C, followed by one 
or more washes m 0.2X SSC, 0,1% SDS at 65° C. 

10140] In certam embodiments, an isolated nucleic acid molecule of the invention 
hybridises under very high stringency conditions. One embodiment of very hi^ 
stringency conditions includes hybridising 0.5M sodium phosphate, 7% SDS at 65°C, 
followed by one or more washes at 0.2X SSC, 1% SDS at 65° C. 

[0141] Other stringency conditions are well known in the art and a skilled 
addressee will recognise tiiat various factors can be manipulated to optimise the 
specificity of the hybridisation. Optimisation of the stringency of the final washes can 
serve to ensure a high degree of hybridisation. For detailed examples, see Ausubel et 
ai, supra at pages 2.10.1 to 2.10.16 and Sambrook et al (1989, supra) at sections 1,101 
to 1.104, 

[0142] While stringent washes are typically earned out at temperatures fix>m about 
42° C to 68° C, one skilled in the art will appreciate that other temperatures may be 
suitable for stringent conditions. Maximtmi hybridisation rate typically occurs at about 
20° C to 25° C below the for fomiation of a DNA-DNA hybrid. It is weU known in 
the art fliat the Tm is the melting temperature, or traiperature at which two 
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complemeataiy polynucleotide sequences dissociate. Methods for estimating Tm are 
well known in fhe art (see Ausubel et al^ supra at page 2.10.8). In general, the Tm of a 
p^ecUy matched duplex of DNA may be predicted as an s^pioximation by the 
formula: 

5 Tm« 81.5 + 16.6 (logio M) + 0.41 (%G+C) - 0.63 (% formamide) - (600/length) 

wherein: M is the concentration of Na*, preferably in the range of 0.01 molar to 0.4 
molan %G+C is the sum of guanosine and cytosine bases as a percentage of the total 
nimiber of bases, within the range between 30% and 75% G+C; % formamide is the 
percent formamide concentration by volimie; length is the number of base pairs in the 
10 DNA duplex. The Tm of a duplex DNA decreases by approximately 1® C with every 
increase of 1% in the number of randomly mismatched base pairs. Washing is generally 
carried out at Tn> - 15** C for high stringency, or Tm 30^ C for moderate stringency. 

[0143] In one example of a hybridisation procedure, a membrane (e.g., a 
nitrocellulose membrane or a nylon membrane) containing immobilised DNA is 

15 hybridised overnight at 42° C in a hybridisation buffer (50% deionised formamide, 
SxSSC, 5x Denhardt's solution (0.1% ficoll, 0.1% polyvinylpyroUidone and 0.1% 
bovine serum albumin), 0.1% SDS and 200 mg/mL denatured salmon sperm DNA) 
containing labelled probe. The membrane is then subjected to two sequential medium 
stringency washes (i.e., 2xSSC, 0.1% SDS for 15 min at 45^ C, followed by 2xSSC, 

20 0.1% SDS for 15 min at 50® C), followed by two sequMitial higher stringency washes 
(i.e., 0.2xSSC, 0.1% SDS for 12 min at 55® C followed by 0.2xSSC and 0,1%SDS 
solution for 12 min at 65-68® C. 

[0144] Variant nucleotide sequences also encompass sequences derived from a 
mutag^c or recombinogenic procedures such as *DNA shuffling' which can be used 

25 for swapping domains in a polyp^tide of interest with domains of other polypeptides. 
With DNA shuffling, one or more different R coding sequences can be manipulated to 
create a new R sequence poss^sing desired properties. In this procedure, libraries of 
recombinant polynucleotides are generated from a population of related polynucleotides 
comprising sequence regions diat have substantial sequence identity and can be 

30 homologous ly recombined in vitro or in vivo. For example, using this approach, 
sequence motifs encoding a domain of interest (e.g., the coiled coil domain, the NBS 
domain and/or the LRR domain of the polypeptides of the invention) may be shuffled 
between the R gene of the invention and other known R genes to obtain a new gme 
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coding for a protein with an improved property of interest, such broadening spectrum of 
disease resistance. Illustrative resistance R genes that could be ^ployed for tiiis 
purpose are listed in Table B below* 

TABLE B 

Plant disease resistance genes cloned prom 1994 to 2003, 



PLANT-PATHOGEN [NTERACfEON 


PLANT (R) 
PROTEIN 
STRUCTUBE 


NAME 


REFERENCE 


Flax 


Afelwnpsora lini 


TIR-NBS-LRR 


L 


Lawrence ei of., 1995 


TOBACCO 


Tobacco mosaic vims 


TIR-NBS-LRR 


N 


Whitman ef a/., 1996 


Flax 


MeUanpsoro IM 


TIR-NBS-LRR 


M 


Anderson ei al» 1997 




P&ronospora parasitica 


TIR-NBS-LRR 


RPP5 


Paikerero/.. 1997 


Arabklopsis 


Fseudomonas 


TIR-NB5-LRR 


RPS4 


Gassmami el«£. 1999 


Fbx 


Meiampsoni ttni 


TIR-NBS-LRR 


P 


Doddse/d:.2000 


3.U A 
rbidopsi 


PseiitdootOKOs syrtngai, 


NBS-LRR 


RPS2 


MindrinoseltfiL. 1994 


Tomata 




NBS-LRR 


Prf 


Salmaoo 1996 


Arbidopsis 


Pseudomonas syringae 


NBS-LRR 


RPMt 


Qvmxetalu 1996 


Wheat 


Heterodera avenae 


NBS-LRR 


Crt3 


Lagudah et al,, 1997 


Tomato 


A? E 

usarium 


NBS-LRR 


12 


Simons e/ oil, 1998 


Tomato 


3.3 M 
eioidpgynepp 


NBS-LRR 


Mi 


MilfisBDeroJ^ 1998 


Tcnnato 


Macrosij^m 
eigihorhie 


NBS-LRR 


Mi 


Mimgane/o/.. 1998 


Aiabidopsis 


Peronospora parasitica 


NBS-LRR 


RPPl 


Botdlae/aJL. 1998 


Lettuce 


Bremia iMfucee 


NBS-LRR 


Dm3 


Meyers ernil, 1998 


Rice 


Xarxthomonas 


NBS-LRR 


Xi\ 


YoshimuRiel/iil. 1998 


Aiabidopsis 


Pseudomonas 


NBS-LRR 


RPS5 


Wairene/olL. 1998 


Maize 


PuccOtia sofghi 


NBS-LRR 


Rpl-D 


Conins era/., 1999 


Pqiper 


Xanthomonas 
campgsffis 


NBS-LRR 


Bs2 


Thai «fdL, 1999 


Potato 


PVX 


NBS-LRR 


Rx2 


Bendahmane etal, 1999 


Rice 


Magntgtorthc 


NBS-LRR 


Pi-te 


BfyanefuJl, 2000 


Baiiey 


Biumeria gmmints 


NBS-LRR 


Mia 


Zhou et aU 2000 


Aralrfdopsis 


Peronospora parasitica 


NBS-LRR 


RPPIS 


Bittner-Eddy et al., 2000 


Tomato 


Tospovmis 


NBS-LRR 
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PLANT-PATHOGEN INTERACTION 


PLANT (R) 
PROTEIN 
STRUCTURE 


R PROTEIN 

N AMP* 


REFERENCE 


Potato 


Globodera pallida 


NBS-LRR 


Gpa2 


Van der Vossen et al., 2000 


Potato 


Pl^ophtora infesUms 


NBS-LRR 


Rl 


BaUvoTa et at, 2002 


Tomato 


Globodera 
rostochiensis 


NBS-LRR 


Hero 


EnisteralL,2002 


Potato 


Pkytophthora bffestans 


NBS-LRR 


RB 


Song eta!^ 2003 



[01451 Strategies for DNA shuffling are known in the art. See, for example: 
Stemmer (1994, Proc. Natl Acad. Sci. USA 91:10747-10751; 1994, Nature 370:389- 
• 391); Crameri et al (1997, Nature Biotech. 15:436-438); Moore et al (1997, J. Mol 
5 Biol 272:336-347); Zhang et al (1997 Proc. Nail Acad. ScL USA 94:4504-4509); 
Crameri et al (1998, Nature 391:288-291); and U.S. Pat Nos. 5,605,793 and 5,837,458. 

4. Polypeptides of the invention 

[0146] The present invention provides polypeptides and biologically active portions 
thereof that confer resistance to disease, especially resistance to pathogenic disease 

10 including disease caused by fungal pathogen, viruses, nematodes, insects and the like. 
Biologically active portions of the R polypeptides of the invention include portions with 
inununo-mtoractive activity of at least about 6, 8, 10, 12, 14, 16» 18, 20, 25, 30, 40, 50, 
60 amino acid residues in length. For example, immuno-interactive fiagments 
conten^>lated by the present invention are at least 6 and desirably at least 8 amino acid 

15 residues in length, which can elicit an immune response in an animal for the production 
of antigen*binding molecules that are immuno-interactive witti the R polypeptides of the 
invention. Such antigen-binding molecules <^ be used to screen organisms, especially 
plants, for structurally and/or fimctionally related R polypeptides. Typically, portions of 
the disclosed R polypeptides may participate in an interaction, e.g., an intramolecular or 

20 an inter-molecular interaction. An inter-molecular interaction can be a specific bmding 
interaction or an enzymatic interaction (e.g., the interaction can be transient and a 
covalent bond is formed or broken). An inter-molecular interaction can be between an R 
prfypeptide and a pathogen elicitor protein. Biologically active portions of an R 
polypeptide include peptides conq>rising amino add sequences sufficiently similar to or 

25 derived flom tiie amino acid sequences of the disclosed R polypeptides, e.g., the amino 
acid sequraces shown in SEQ ID NO: 2 or 4, which include less amino acids than the 
fiill-length R polypq>tide, and exhibit at least one activity of an R polypeptide. 
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Typically, biologically active portions coiiq)ris6 a domain or motif wift at least one 
activity of Uie R polyp^tide, e.g., the ability to bind to a pa&ogoi elidtor protein or to 
confCT disease resistance. A biologically active portion of an R polypq)tide can be a 
polypeptide which is, for example, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20. 
5 21, 22, 23. 24, 25, 30, 40, 50, 60, 70, 80, 90, 100, 120, 150, 300, 400, 500. 600, 700, 
800, 900 or 1000 or more amino acids in lengtiu Suitably, the portion is a '"biologically- 
active portion" having no less than abo>it 1%, 10%, 25% 50% of the pathogen eUcitor 
protdn-binding activity or the resistanoe-cwiferring activity of tiie fiill-lengfli 
polypeptide. 

10 [0147] The present invention also contemplates variant R polypqitides. "Variant" 
polyp^tides include proteins delved firom the native pvotdn by deletion (so-called 
truncation) or addition of one or mote amino acids to the N-terminal and/or C-traminal 
end of the native protein; deletion or addition of one or more amino adds at one or more 
sites in tiie native protein; or substitution of one or more amino acids at one or more 

15 sites in the native protem. Variant proteins encwnpassed by ttie present invention are 
biologically active, that is tiiey continue to possess the desired Iriological activity of the 
native protein, tiiat is, modulating disease resistance or interacting witii a pa&ogai 
elicitor protem. Such variants may result from, for example, genetic polymorphism or 
ftom human manipulation. Biologically active variants of a native R protein of Ihe 

20 invention wiU have at least 40%, 50%, 60%, 70%, genestally at least 75%, 80%, 85%, 
preferably about 90% to 95% or mote, and more prefeably about 98% or more 
sequoice sfanilaiity with tiie ammo acid sequence for the native protein as detomined 
by sequence alignment programs described elsewhere herdn uang default parameters. 
A biolo^cally active variant of a protein of the invention may differ from that protein 

25 by as few as 1-15 amino add residues, as few as 1-10, sudi as 6-10, as few as 5, as few 
as 4, 3, 2, or even 1 amino acid residue. 

[0148] The pro^ns of tto mvention may be alteted in various ways mduding 
amino add substitutions, deletions, ttuncatioos, and insertions. Metiiods for such 
manipulations are generally known in the art. For example, amino acid sequence 
30 variants of the R proteins can be prepared by mutations in the DNA. Mefliods for 
mutagenesis and nucleotide sequence alterations are well known in tiie art. See, for 
exan^le, Eunkd (1985, Proc. Natl. Acad. ScL USA 82:488-492), Kunkd et at. (1987, 
Methods in Enzymol 154:367-382), U.S. Pat No. 4,873,192. Watson. J. D. et al. 
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("Molecular Biology of the Gene", Fourth Edition, Bedjamin/Cummmgs, Menlo Park, 
Calif., 1987) and the references rated therein. Guidance as to appropriate amino acid 
substitutions that do not affect biological activity of the protein of interest may be found 
in the model of Dayhoff et al. (1978) Atlas of Protein Sequence and Structure (NatL 
5 Biomed. Res. Found.. Washington. D.C.). Methods for screening gene products of 
combinatorial libraries made by pomt mutations or truncation, and for screening cDNA 
Ubraries for gene products havmg a selected property are known in the art Such 
methods are adaptable for rapid screetung of the gene libraries generated by 
combinatorial mutagenesis of R polypeptides. Recursive ensemble mutagenesis (REM), 
10 a new technique which enhances the fiequency of fanctional mutants in the Ubraries. 
can be used m combination with the screening assays to identify R polypeptide variants 
(Aridn and Yourvan (1992) Proc NatL Acad. Set USA «9:78n-7815; Delgrave et al 
(1993) Protein Engineering 6:327-331). Conservative substitutions, such as exchanging 
one amino acid with another having smnlar properties, may be desirable as discussed in 
1 S more detail below. 

[0149] Variant polypeptides may contain conservative amino acid substitutions at 
various locations along their sequence, as compared to the R polypeptide amino acid 
sequences of the invention. A "conservative amino acid substitution" is one in which 
the amino acid residue is replaced witii an amino acid residue having a similar side 
20 chain. FamUies of ammo acid residues having shmlar side chains have been defined in 
the art, which can be generally sub-classified as foUovro: 

(01501 Acidic: The residue has a negative charge due to loss of H ion at 
physiological pH and the residue is attracted by aqueous solution so as to seek the 
surface positions in the conformation of a peptide in which it is contained when the 
25 peptide is in aqueous medium at physiological pH. Ammo acids having an acidic side 
chain include glutamic acid and aspartic acid. 

lOlSll Basic: The residue has a positive charge due to association with H ion at 
physiological pH or withm one or two pH units thereof (e.g.. histidine) and the residue 
is attracted by aqueous solution so as to seek the surface positions m the conformation 
30 of a p^tide in which it is contained when the peptide is m aqueous medium at 
physiological pH. Amino acids having a basic side chain include arginine, lysine and 
histidine. 
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[0152] Charged: The residues are charged at physiological pH and, therefore, 
include amino acids having acidic or basic side diains (i.e.. glutamic acid, aspartic acid, 
arginine, ly^e and histidine). 

[0153] Hydrophobic: The residues are not charged at physiological pH and the 
residue is rq>elled by aqueous solution so as to seek the inner positions in the 
conformation of a pep&de in which it is contained when the peptide is in aqueous 
medium. Amino acids having a hydrophobic side chain include tyrosine, vahne, 
isoleucine, leucine, methionine phmylalanine and tryptophan. 

[01541 Neutral/polan The residues are not charged at physiological pH, but the 
residue is not sufficiently repeUed by aqueous solutions so that it would seek inner 
positions in the conformation of a peptide in which it is contained when the peptide is in 
aqueous medium. Amino acids having a neutral/polar side chain include asparagine, 
glutamme, cysteine, histidine, serine and threonine. 

[OlSSI This description also characterises certain amino acids as "small" since thdr 
15 side chains are not sufBciently large, even if polar groups are lacking, to confer 
hydrophobicity. With the exception of proline, "small" amino acids are those with four 
carbons or less when at least one polar group is on the side chain and three carbons or 
less when not. Amino acids having a small side chain include glycine, serine, alanine 
and threonine. The gene-encoded secondary amino acid proline is a special case due to 
20 its known effects on the secondary conformation of peptide chains. The structure of 
proline differs from aU the other naturaUy-occurring amino acids in that its side chain is 
bonded to the nittogen of the oi-amino group, as weU as the owsarbon. Several amino 
acid similarity matrices (e.g., PAM120 matrix and PAM250 matrix as disclosed for 
example by Dayhoflf et al. (1978) A model of evolutionary change in proteins. Matrices 
for determining distance relationships In M. O. Dayhofif, (ed.). Adas of protein 
sequence and structure. Vol. 5. pp. 345-358, National Biomedical Research Foundation, 
Washmgton DC; and by Gonnet et al., 1992, Science 256(5062): 144301445), however, 
include proline in the same group as glycine, serine, alanine and threonine. 
Accordingly, for the purposes of the present invention, proline is classified as a "smaU" 
30 amino acid. 

[0156] The degree of attraction or repulsion required for classification as polar or 
nonpolar is arbitrary and, therefore, amino acids specifically contemplated by the 



25 
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invention have been classified as one or the other Most amino acids not specifically 
named can be classified on the basis of known behaviour. " 

(01571 Amino acid residues can be further sub-classified as cyclic or noncyclic, and 
aromatic or nonaiomatic. self-explanatory classifications wift reject to fixe dde^ihain 
5 substituent groups of the residues, and as small or laige. The lesidue is considered smaU 
if it contains a total of fi,ur carbon atoms or less, inclusive of the caiboxyl carbon, 
provided an additional polar substituent is present; fiu^e or less if not. SmaU residues' 
are, of course, always nonaromatic. Dependent on their atractural properties, amino acid 
residues may fall in two or more classes. For the naturally-occuning protem amino 
10 acids, sub-classification according to the this scheme is presented in the Table C. 

TABLE C 
Amino Arm s^m ^Assmr A-nnra 



Sub-classes 


Amino adds 


Acidic 


Aspartic acid. Glutamic acid 


Basic 


Noncyclic: Arginme, Lysme; CycHc: Histidme 


Charged 


Aspartic acid, Glutamic add, Arginme, Lysine, Histidine 


Small 


Glycine, Serine, Alanine, Threonine, Proline 


Polar/neutral 


Asparagine, Histidine, Glutamine, Cysteine, Serine, Threonine 


Polar/large 


Asparagine, Glutamine 


Hydrophobic 


Tyrosine, Valine, Isoleudne, Leudne, Meftdonine, 
Phenylalanine, Tryptophan 


Aromatic 


Tryptophan, Tyrosine, Phenylalanine 


Residues that influence 
chain orientation 


Glycine and Proline 



[0158] Accordingly, the present invention also contemplates variants of the 
naturally occurring or parent R polypeptide sequences or their biologically-active 
fiagments, wherein the variants are distinguished fiom the parent sequences by the 
addition, deletion, or substitution of one or more amino acids, lii general, variants 
display at least about 30. 40. 50. 55. 60, 65. 70. 75, 80. 85, 90, 91. 92, 93. 94, 95. 96, 97, 
98, 99 % similarity to a parent R polypeptide sequence as for example set forth in SBQ 
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ID NO: 2 or 4. Desirably, variants wiU have at least 30, 40, 50, 55, 60, 65, 70, 75, 80, 
85, 90, 91. 92, 93. 94. 95, 96, 97. 98, 99 % sequence identity to a parent R polypeptide 
sequence as set forth in SEQ ID NO:2 or 4. Moreover, sequences differing fiom the 
native or parent sequences by the addition, deletion, or substitution of 1, 2, 3, 4, 5, 6, 7, 
5 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19. 20, 30, 40. 50, 60 ,70, 80 ,90, 100, 150, 200, 
300. 500 or more amino adds but which retain the disease-iesistance-conforing or 
pathogen elicitor-Wnding properties are contanplated. Polypeptides of the invention 
include polypeptides that are encoded by polynucleotides that hybridise under 
stringency conditions as defined herein, especially high stringency conditions, to tiie 
10 polynucleotide sequences of the invention, or the non-coding strand fliereof, as 
described above. 

[01591 In one embodiment, variant polypeptides diflfer from the disclosed 
sequences by at least one but by less than 50, 40. 30. 20. 15, 10, 8, 6, 5, 4, 3, 2 or 1 
amino acid residue(s). In another, variant polypeptides differ fi-om the coiresponding 
15 sequence in SEQ ID NO: 2 or 4 by at least 1% but less ttian 20%, 15%, 10% or 5% of 
the residues. (If tiiis comparison requires alignment fbe sequences should be aligned for 
maxhnum similarity. '*Looped" out sequences fiom deletions or insertions, or mismatches, 
are considered differences.) The differences are, suitably, differences or changes at a 
non-essential residue or a conservative substitution. 

20 [0160] A **non-essential" amino acid residue is a residue that can be altered from 
the wild-type sequence of an R polypeptide without abolishing or substantially altering 
one or more of its activities (e.g., disease-resistance or patiiogen elicitor-binding 
properties). Suitably, the alteration does not substantially alter one of these activities, 
e.g.. the activity is at least 20%, 40%, 60%, 70% or 80% of wild-type. An "essential" 

25 amino acid residue is a residue that, when altered fiom the wild-type sequence of an R 
polypq)tide of the mvention, results in abolition of disease-resistance or pathogen 
eUcitor-binding properties sudi that less than 20% of flie wild-type activity is present. 
Fot racan^le. conserved amino add residues between tiie R polypq)tides shown in 
Figure 2 may be imamenable to alteration. 

30 (01611 Desirable variant R polypeptides are those having conserved amino acid 
substitutions. Examples of conservative substitutions include the following: aspartic- 
glutamic as acidic amino acids; lysine/arginine/histidine as basic amino acids; 
saine/fijycine/alanine^threonine as small amino acids; leucine/isoleucine. 
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methionine/valine, alanine/valine as hydrophobic amino acids. Conservative amino acid 
substitution also includes groupings based on side chains. For example, a group of 
amino acids having aliphatic side chains is glycine, alanine, valme, leucine, and 
isoleucine; a group of amino acids having aliphatic-hydroxyl side chains is srane and 

5 threonine; a grooip of amino acids having amide-containing side chains is asparagme 
and glutamine; a group of amino acids having aromatic side chains is phenylalanme, 
tyrosine, and tryptophan; a group of amino adds having basic side chains is lysine, 
arginine, and histidine; and a group of amino acids havmg sulphur-containing side 
chains is cysteine and methionine. For example, it is reasonable to expect that 

10 replacement of a leucine with an isoleucine or valine, an aspartate with a glutamate, a 
threonine with a serine, or a similar replacement of an amino acid with a structurally 
related amino add will not have a major effect on tiie properties of the resultmg variant 
polypeptide. Whether an amino add change results in a functional R polypeptide can 
readily be detennined by assaying its disease resistance-conferring activity or its 

15 pathogen-elicitor-binding activity. Cons^ative substitutions are shown in Table D 
below under the heading of ^emplary substitutions. More preferred substitutions are 
shown undor the heading of preferred substitutions. Amino acid substitutions falling 
within the scope of flie invention, are, in general, accomplished by selecting 
substitutions that do not diffisr significantly in their effect on maintaming (a) the 

20 structure of the peptide bactoone m &e area of the substitution, (b) the charge or 
hydrophobidty of the molecule at the target site, or (c) the bulk of the side chain. After 
ttie substitutions are introduced, the variants are screened for biological activity. 



TABLED 

EXEMPLARV AND PREPERREO AMINO AClD S UBSTTTUTIONS 



Original Residue 


Exemplary Substitutions 


Preferred Substitutions 


Ala 


Val,Leu,ne 


Val 


Arg 


Lys, Gin, Asn 


Lys 


Asn 


Gfai,His,Lys, Arg 


Gbi 


Asp 


Glu 


Glu 


Cys 


Ser 


Ser 


Gin 


Asn, His, Lys, 


Asn 
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Bxen3plary Substitutions 


Preferred Substitutions 


Glu 


AsD. Lys 


A^ 


Glv 


Pro 


Pro 


His 


Asn» Gin, Lys, Arg 


Arg 




Ten VaL Met Ala. Phe. Norleu 


Leu 




Norleu. He. Val MetAla.Phe 


ne 


Lys 


Afff Glti Ash 

4ru|gy vjmy zupu 


Are 




T^ii Tie Phe 


Leu 




T i^ii Val Tl^ Ala 


Leu 


Pm 


Glv 


Gly 


OCT 


Thr 


Thr 


Thr 


Ser 


Ser 


Trp 


Tyr 


Tyr 


Tyr 


Tip, Phe, Thr, Ser 


Phe 


Val 


He, Leu, Met, Phe, Ala, Norleu 


Leu 



[0162] Alternatively, similar amino acids for making conservative substitutions can 
be grouped into three categories based on the identity of the side chains. The first group 
includes glutamic acid, aspartic acid, arginine, lysine, histidine, which all have charged 
5 side chains; the second group includes glycine, serine, threonine, cysteine, tyrosine, 
glutamine, asparagme; and the thhd group includes leucine, isoleucine, valine, alanine, 
proline, phenylalanine, tryptophan, methionine, as described in Zubay, G., 
Biochemistry, third edition, Wm.C. Brown Publishers (1993). 

101631 Thus, a predicted non-essential ammo acid residue in an R polypeptide is 
10 typically replaced with another amino acid residue from the same side chain family. 
Alternatively, mutations can be introduced randomly along all or part of an R gene 
coding sequence, such as by saturation mutagenesis, and die resultant mutants can be 
screwed for disease resistance-conferring activity or pathogen-elicitor-binding activity 
to idmtify mutants that retain activity. Following mutagenesis of the coding sequences, 
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the encoded peptide can be exi»essed lecombinantly and flie activity of flie pq>tide can 
be determined. 

[0164] In olher embodiments, the SCB includes an amino acid sequence having at 
least about 50%» 55%, 60%. 65%, 70%, 75%, 80%, 85%. 90%, 91%, 92%, 93%. 94% 
5 95%, 96%, 97%, 98% or more similarity to a corresponding sequence of SEQ ID NO: 2 
or 4, and has disease resistance-conferring activity or pafliogen-elicitor-binding activity. 

[0165] The R polypeptides of the present invention contain a significant number of 
stmctural charactaistics in common with each other as for example depicted in Figure 
2. The tram "femily" when referring to the protein and nucleic acid molecules of flie 
10 invention means two or more proteins or nucleic acid molecules having a common 
stmctural domain or motif and having suiBcient amino acid or nucleotide sequence 
homology as defined hwein. Sudi fiunily members can be naturally or non-naturally- 
occuning and can be fijom either^ same or different species. Members of a fenuly can 
also have common functional characteristics. 

15 5. Antt-Rpofypqptideantlgen'bindbtg molecules 

[0166] The invention also provides an anti^-binding molecule that is specifically 
immuno-interactive with an R polypeptide of flie invention, fii one embodiment, the 
antigen-binding molecule comprise whole polyclonal antibodies. Such antibodies may 
be prepared, fax example, by injecting a polypeptide, portion or variant of the invention 

20 into a production species, which may include mice or rabbits, to obtam polyclonal 
antiseta. Methods of producing polyclonal antibodies are well known to those skilled in 
the art. Exemplary protocols which may be used are desoibed for example in Cofigan et 
al., CURRENT PROTOCOLS IN IMMUNOLOGY, (John Wiley & Sons. Inc. 1991), 
and Ausnbel et al., (1994-1998, supra), in particular Swdoa m of Chapter 1 1. 

25 [0167] In lieu of polyclonal antisera obtained in a production species, monoclonal 
antibodies may be produced using the standard method as described, for exan^le. by 
KShler and Milstdn (1975, Nature 2S6, 495-497). or by more recent modifications 
thereof as described, for example, in CoUgan et aL, (1991, stg>ra) by immortalising 
spleen or ofiier antibody producing cells derived fiwm a production ^ecies which |ias 

30 been inoculated with one or more of the polypqitides, fiagments, variants or derivatives 
of flie invention. 
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[0168J The invention also contemplates as antigen-binding molecules Fv, Fab, Fab* 
and F(ab')2 immimoglobulin fragments. Alternatively, the antigen-binding molecule 
may comprise a synthetic stabilised Fv fragment. ExCTiplary fragments of this typo 
include single chain Fv fragments (sFv, frequently tenned scFv) in which a peptide 
5 linker is used to bridge the N tennmus or C terminus of a Vjy domain with tfie C 
terminus or N-tenninus, respectively, of a Vi domain. ScFv lack aU constant parts of 
whole antibodies and are not able to activate complement ScFvs may be piepared, for 
example, in accordance with methods outlined m Kreb^: et al (Kreber et al. 1997, J. 
Immunol Methods^ 201(1): 35-55). Alternatively, Uiey may be prepared by methods 

10 described in US. Patent No 5,091,513, European Patent No 239,400 or the articles by 
Winter and Milstein (1991, Nature 349:293) and Pltickthun et al (1996, In Antibody 
engineering: A practical approach, 203-252). In another embodiment, the synthetic 
stabilised Fv fragment comprises a disulphide stabilised Fv (dsFv) in which cysteine 
residues are introduced into the Vh and domains such that in the folly folded Fv 

15 molecule the two residues will form a disulphide bond tiierebetween. Suitable methods 
of producing dsFv are described for example in (Glockscuther et al Biochem. 29; 1363- 
1367; Reiter et al 1994, 7. Biol Chem, 269: 18327-18331; Reiter et al 1994, Biochem. 
33: 5451-5459; Reiter et al 1994. Cancer Res. 54: 2714-2718; Webber et al 1995, Mol 
Immunol 32: 249-258). 

20 [0169J Phage display and combinatorial methods for generating R polypeptide 
antigen-binding molecules are known in the art (as described m» e,g., Ladnor et al. U.S. 
Patent No. 5,223,409; Kang et al International Publication No. WO 92/18619; Dower et 
al. Intemational Publication No. WO 91/17271; Winter et al. International Publication 
WO 92/20791; Markland et al. Intemational Publication No. WO 92/15679; BreitUng et 

25 al. Intemational Publication WO 93/01288; McCafferty et al. International PubUcation 
No. WO 92/01047; Garrard et al. International Publication No. WO 92/09690; Ladner et 
al. Intemational Publication No. WO 90/02809; Fuchs et al. (1991) Bio/Technology 
9:1370-1372; Hay et al (1992) Hum Antibod Hybridomas 3:81-85; Huse et al. (1989) 
Science 246:1275-1281; Grififths et al. (1993) EMBO 7 12:725-734; Hawkins et al. 

30 (1992) J Mol Biol 226:889-896; Qackson et al. (1991) Nature 352:624-628; Gram et al. 
(1992) PNAS 89:3576-3580; Garrad et al. (1991) Bio/Technology 9:1373-1377; 
Hoogenboom et al. (1991) Nuc Acid Res 19:4133-4137; and Barbas et al. (1991) PNAS 
88:7978-7982). The antigen-binding molecules can be used to screen expression 
libraries for variant R polypeptides. They can also be used to detect and/or isolate the R 
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polypeptides of the invention. Thus, the invention also contemplates the use of antigen- 
binding molecules to isolate R polypeptide usuig , for example, any suitable 
immunoaffinity based mefliod including, but not limited to, immunochromatogr^hy 
and immunopreoipitation. A suitable mediod utilises solid phase adsorption in which 
5 anti-R polypeptide antigen-binding molecules are attadied to a suitable resin, the resin 
is contacted with a sample suspected of containing a natriuretic peptides, and ttie 
natriuretic peptides, if any, are subsequ«itly eluted &om flie resm. Blustrattve resins 
include: Sqjhatose® (Pharmacia), Poros® resins (Etoche Molecular Biochemicals, 
IndianapoKs), Actigel Superflow™ resins (Sterogene Biosepaiations Inc., Carlsbad 
10 Calif.), and Dynabeads™ (Dynal Inc., Lake Success, N.Y.). 

[01701 The antigen-binding molecule can be coupled to a compound, e.g., a label 
such as a radioactive nucleus, or imaging agent, e.g. a radioactive, enzymatic, or other, • 
e.g., imaging agent, e.g., a NMR contrast agrait. Labels which produce detectable 
radioactive emissions or fluorescoice are preferred. An anti-R polypeptide antigen- 

15 binding molecule (eg., monoclonal antibody) can be used to detect R polypeptides 
(e.g., m a cellular lysate or cell siqsematant) m order to evahiate the abundance and 
pattern of expression of the proton. Anti-R polypeptides antigen-binding molecules can 
be used diagnostically to monitor R polypqitides levels m tissue as part of an 
agronomic testing procedure. Detection can be fecilitated by coupling (i.e.. physically 

20 linkina> the antibody to a detectable substance (Le., antibody labellmg). Examples of 
detectable substances include various enzymes, joosthetic groups, fluorescait materials, 
luminescent materials, bioluminescwt materials, and radioactive materials. Examples of 
suitable enzymes mclude horseradish peroxidase, alkaline phosphatase, p-galactosidas^ 
or acetylcholinesterase; exanqiles of suitable prosthetic grot^ con^lexes toclude 

25 streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials include 
umbelliferone, fluorescein, fluorescein isotbiocyanate, rhodamine, 
dichlorotriazinylamine fluorescein, dansyl chloride or phycoerytiuin: an exanq)le of a 
luminescent material includes luminol; examples of bioluminescent matmals include 
luciferase, luciferin, and aequorin, and examples of suitable radioactive matraial include 

30 '^L '"L or 'H. The label may be selected from a group including a chromogen, a 
catalyst, an enzyme, a fluorophore, a chemiluminescent molecule, a lanthanide ion such 
as Europium (Eu^*), a radioisotope and a direct visual labeL In flie case of a direct visual 
label, use may be made of a colloidal metallic or non-metallic particle, a dye particle, an 
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enzyme or a substrate, an organic polymer, a laiex particle, a liposome, or other vesicle 
containing a signal producing substance and flie like. 

10171] A large number of enzymes useful as labels is disclosed in United States 
Patent Specifications U.S. 4,366,241, U.S. 4,843,000, and U.S. 4,849,338. Enzyme 
5 labels useful in the present mvention include alkaline phosphatase, horseradish 
peroxidase, luciferase, p-galactosidase, glucose oxidase, lysozyme, raalate 
dehydrogCTase and the like. The enzyme label may be used alone or in combination 
with a second enzyme in solution. 

6. Nucleic add constructs 
10 6.1 Prokarvotic expression 

[0172] The present invention furflier relates to a nucleic acid construct designed for 
genetic transformation of prokaryotic cells, comprising a polynucleotide, portion or 
variant according to the invention operably linked to a regulatory sequence, which will 
typically comprise a transcriptional control elraient or promoter. Suitably, the chimeric 

15 construct is operable in a Gram-negative prokaryotic cell. A variety of prokaryotic 
expression vectors, which may be used as a basis for constructing the chnneric nucleic 
acid construct, may be utilised to express a polynucleotide, portion or variant according 
to flie invention. These include but are not limited to a chromosomal vector (e.g., a 
bacteriophage such as bacteriophage X), an extrachromosomal vector (e.g., a plasmid or 

20 a cosmid expression vector). The expression vector will also typically contain an origin 
of replication, vMch allows autonomous replication of the vector, and one or more 
genes that allow phenotypic selection of the transformed cells. Any of a number of 
suitable promoter sequences, includmg constitutive and inducible promoter sequences, 
may be used in the expression vector (see e.g.. Bitter, et aL, 1987, Methods in 

25 Enzymology 153: 516-544). For €xanQ)le, inducible promoters such as pL of 
bactoiophage y, plac, ptip, ptac ptip-lac hybrid promoter and the like may be used. The 
nucleic acid construct may then be used to transform the desired prokaryotic host cell to 
produce a recombmant prokaryotic host cell, e.g., for producmg a recombmant R 
polypeptide. 

30 6.2 Bukarvotic expression 

[0173] The invention also contemplates a nucleic acid construct designed for 
expressing a polynucleotide, portion or variant of the invention in a eukaryotic host cell. 
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A variety of eukaiyotio host-expression vector systems may be utilised in this regard. 
These include, but are not limited to. yeast transformed with recombinant yeast 
expression vectors; insect ceU systems infected with recombinant virus expression 
vectors (e.g., baculovirus); or animal ceU systems infected with recombmant virus 
5 expression vectors (e.g., retroviruses, adenovirus. Vaccinia virus), or transfonned 
animal cell systems engineered for stable expression. In certam advantageous 
embodiments, the chimeric nucleic acid construct is designed for genetic transformation 
of plants as described hereinafter. 

6.3 Plant expression 

10 [01741 In accordance with the present invention, it is proposed that the R gene 
polynucleotides, portions and variants of the invention will be useful for facilitating the 
constraction of crop plants that are resistant to pathogenic disease, mcluding diseases 
caused by fungal pathogens, vmises, nematodes, insects and the like. Accordingly, the 
present mvention also relates to operably linking a polynucleotide, portion or variant of 

15 as described hereinto a regulatory sequence (e.g.. a promoter aad a 3* non-translated 
region) that is function m plants to create a nucleic acid construct, designed for genetic 
transfonnation of plants. 

Plant jnvmoters 

r0175| Numerous promoters that are active m plant cells have been described in the 
20 literature, Ulustrative examples of which molude the nopalme synthase (NOS) promoter, 
the octopine synthase (OCS) promoter (which is carried on tumour-inducing plasmidi 
of Agrobacterium tumeftciens), the cauhmoviras promoters such as the cauliflower 
mosaic virus (CaMV) 19S promoter and the CaMV 35S promoter, ttie figwort mosaic 
vhus 35S-promoter, flie li^t-indudble promoter fi»m the small subunit of ribulose-l,S- 
25 bis-phosphate carboxylase (ssRUBISCO). the Adh promoter, the sucrose synthase 
promoter, the R gene complex promoter, the GST-n-27 gene promoter and the 
chlonqthyll a/b binding protein gene promoter, etc. 

[017Q For the purpose of expression in source tissues of ttie plant, such as flie leaf, 
seed, root or stem, it is desirable that the promoters driving expression of the target gene 
30 have idativrfy high expression in fbsse specific tissues. For this purpose, one may 
choose from a number of promoters for genes with tissue- or ccU-specific or enhanced 
expression. Examples of such promoters rqwwted m the literature mclude the 
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cWoroplast glutamine synthetase GS2 promoter fiom pea. flie cWoioplast fiuctose.1,6. 
biphosphatase (FBPase) promoter ftom wheat, the nuclear photosynthetic ST-LSl 
promoter from potato, the serine^threomne kinase (PAL) promoter and the 
glucoamylase (CHS) promoter fiom Atabidopsis thaliana. Also r^rted to be active in 
5 pbotosyntheticaUy active tissues are the ribulose-LS-bisphosphate carbox3iase (RbcS) 
promoter from eastern larch (Larix laridna), the prxmioter for the cab gene, cab6. fiom 
pine, the promoter for the Cab-1 gene fiom wheat, the promoter for the CAB-l gene 
from spinach, the promoter for the cablR gene fiom rice, the pyruvate, orthopho^hate 
dikinase (PPDK) promoter fiom com, the promoter for the tobacco Lhcbl*2 gene, the 
10 Arabidopsis thaliana SUC2 sucrose-H+ symportnr and the promoter fi,r the thylakoid 
membrane proteins fiom spinach (psaD, psaF. psaE. PC, FNR, atpC, a^D, cab, ibcS). 
Other promoters for the chlorophyll a^-binding piotems may also be utilised in the 
invention, such as the promoters for LhcB gene and PsbP gene fiom white mustard. 
[01771 For the purpose of expression in sink tissues of the plant, such as the tuber 
15 of the potato plant, the fruit of tomato, or (he seed of com. wheat, rice and barley, it is 
desuable that the promoters driving cxpresrion of die target gene have relatively hi^ 
expression in these specific tissues. A number of promotets for genes with tuber- 
specific or tuber-enhanced expression are known, mcluding the class I patatin promoter, 
the promoter for the potato tuber ADPCTP g^es. both the large and smaU subunits. the 
20 sucrose synthase promoter, the promoter for the rn^or tuber proteins including the 22 
kd protein complexes and protease inhibitors, the promoter for the granule-bound starch 
synflrase gene (GBSS) and other class I and n patatins promoters. 

(01781 Other promoters can also be used to express a target gene in specific tissues, 
such as seeds or fiiiits. Examples of such promoters include the 5' regulatory regions 

25 fiom such genes as napin, phaseolm, soybean trypsm inhibitor, ACP. stearoyl-ACP 
desaturase, soybean «• subunit of p-conglycinin (soy 7s), and oleosm. Further examples 
include fte promoter for P-conglycinin. Also included are the zeins. which are a group 
of storage proteins found in com endosperm. Genomic clones for zein genes have been 
isolated and the promoters ftom these clones, includmg the 15 kD, 16 kD, 19 kD, 22 

30 kD, 27 kD and genes, could also be used. Othw promoters known to fimction,*fbr 
example, in com mclude the promoters fi>r the following genes: waxy. Brittle, Shrunken 
2, Branching enzymes I and n, starch synthases, debianchirrg enzymes, olcosms, 
glutelins and sucrose synthases. Examples of promoters suitable for expression m wheat 
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include those promoters for the ADPglucose pyrosynthase (ADPGPP) subunits, the 
granule bound and other starch synthase, the branching and debianching enzymes, the 
embiyogenesis-abundant proteins, the ghadins and the glutemns. Examples of such 
promoters in rice include those promoters for the ADPGPP subunits, the granule bound 
5 and other starch synthase, the branching enzymes, the debianching enzymes, sucrose 
synthases and the ghitelins. Examples of such promoters for barley include those for the 
ADPGPP subunits, the granule bound and other stareh synfliase, the branchmg 
enzymes, the debranching enzymes, sucrose synthases, the hoideins, the embryo 
globulins and the aleurone specific proteins. 

10 [01791 Root specific promoters may also be used. An example of such a promoter 
is the promoter for the acid chitinase gene. Expression m root tissue could also be 
accomplished usmg the root specific subdomams of the CaMV35S promoter that have 
been identified. 

[01801 Desirable promoteis for expression in cultured cells are strong constitutive 
•15 promotera, or promoters that respond to a specific inducer (Gatz and Lenk, 1998. 
Trends Phmt Science 3: 352-8). In certain embodunents, nucleic acid constmcts 
expressing R polynucleotides of the present mvention are introduced into banana plants 
that are susceptible Exemplary constitutive promoters for expression m intact banana 
plants are described in International PubUcation No. WO 02/053744 and in co-pending 
20 PCT AppKcation No. PCT/AU03/00919. 

6.3.2 3 ' Non-lrandtUed region 
[01811 The constmcts of the present invention can comprise a 3' non-translated 
sequence. A 3' non-translated sequence refers to that portion of a gene comprising a 
DNA segment that contains a polyadenylation signal and any otha- regulatory signals 
25 capable of effecting mRNA processing or gene expression. The polyadenylation signal 
is characterised by effecting the addition of polyadenylic acid tracts to the 3' end of the 
mRNA precursor. Polyadenylation signals are commonly recognised by identity with 
the canonical fonn 5' AATAAA-3* although variations are not uncommon. 

[01821 The 3' non-translated regulatory DNA sequence preferably includes from 
30 about 50 to 1,000 nucleotide base pairs and may contam plant transcriptional and 
ttanslational termination sequences in addition to a polyadenylation signal and any oflier 
regulatory signals capable of effecting mRNA processing or gene expression. Bxan^les 
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of suitable 3' non-translated sequences are the 3* transcribed non-translated regions 
containing a polyadraylation signal fiom the nopaline synthase {nos) gene of 
Agrobacterium tumefaciens (Bevan et al., 1983, NucL Add Res., 11:369) and the 
terminator for the T7 transcript from the octopine synthase gene of Agrobacteriwn 
S tumefaciens. Alternatively, suitable 3* non-translated sequences may be derived from 
plant genes such as the 3' end of the protease inhibitor I or n genes fiom potato or 
tomato, die soybean storage protein genes and the pea E9 ^all subunit of the ribulose- 
l,S-bisphosphate carbox}4ase (ssRUBISCO) gene, although oth^ 3' elements known to 
those of skill in the art can also be employed. Alternatively, 3' non-translated regulatory 
10 sequences can be obtained de novo as, for example, described by An (1987, Methods in 
Enzymology» 153:292). 

6.3.3 Optional sequences 
[0183] The nucleic acid construct of the present invention can furthw include 
enhancers, either translation or transcription enhancers, as may be required. These 

15 enhancer regions are well known to persons skilled in the art, and can include the ATG 
initiation codon and adjacent sequences. The initiation codon must be in phase with the 
reading frame of the coding sequence relating to the foreign or endogenous DNA 
sequence to ensure translation of the entire sequence. The translation control signals and 
initiation codons can be of a variety of origins, both natural and synthetic. Translational 

20 initiation regions may be provided fiom the source of the transcriptional initiation 
region, or from the foreign or endogenous DNA sequence. The sequence can also be 
derived fixim the source of the promoter selected to drive transcription, and can be 
specifically modified so as to increase translation of tiie mRNA. 

[0184] Examples of transcriptional enhancers include, but are not restricted to, 
25 elements from the CaMV 35S promoter and octopine synthase genes as for example 
described by Last et al (U.S. Patent No. 5,290,924, which is incorporated herein by 
reference). It is proposed that the use of an enhancer element such as the ocs element, 
and particularly multiple copies of the element, will act to increase the level of 
transcription fiom adjacent promoters when applied in the context of plant 
30 transformation* Alternatively, fiie omega sequence derived fix>m tfie coat protein gene of 
the tobacco mosaic virus (Gallie et a/., 1987) may be used to enhance translation of the 
mRNA transcribed fix>m a polynucleotide according to the invmtion. 
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[0185] As the DNA sequence inserted between the transcription initiation site and 
the start pf the coding sequence, i.e.» the untranslated leader sequence, can influmce 
gene expression, one can also employ a particular leader sequence. Preferred lead^ 
sequences include those tiiat comprise sequences selected to direct optimum expression 
5 of the foreign or endogenous DNA sequence. For example, such leader sequences 
include a prefixed consensus sequence which can increase or maintain mRNA stability 
and prevent inappropriate initiation of translation as for example described by Joshi 
(1987, NucL Acid Res.y 15:6643), which is incorporated herem by reference. However, 
other leader sequences, eg., the leader sequence of RTBV, have a high degree of 
10 secondary structure that is expected to decrease nlRNA stability and/or decrease 
translation of &e mRNA, Thus, leader sequences (i) that do not have a high degree of 
secondary structure, (ii) that have a big^ degree of secondary structure where the 
secondary structure does not inhibit mRNA stability and/of decrease translation, or (iii) 
that are derived from genes that are highly expressed in plants, will be most preferred. 

IS [0186] Regulatory elements such as the sucrose synthase intron as, for example, 
described by Vasil et al (1989, Plant Physiol^ 91:5175), the Adh intron I as, for 
example, described by Callis et al (1987, Genes Develop.^ II), or (he TMV omega 
element as, for example, described by Gallie et al. (1989, The Plant Cell, 1:301) can 
also be included where d^ed Other such regulatory elements useful in the practice of 

20 the invention are known to those of skill in the art 

[0187] Additionally, targeting sequences may be employed to target R polypeptide 
to an intracellular conQ>artment within plant cells or to fte extracellular environment 
For exan^le, a DNA sequence encoding a transit or signal peptide sequ^ce ntiay be 
operably linked to a sequence encoding the R polypeptide or biologically active portion 

25 ' thereof such tiiiat, when translated, the transit or signal peptide can transport the 
polypeptide or portion to a particular intracellular or extracellular destination, and can 
then be post-translationally removed. Transit or signal pq)tides act by facilitating the 
transport of proteins throu^ intracellular membranes, e.g., endoplasmic reticulum, 
vacm^, vesicle, plastid, nutochondrial and plasmalemma membranes. For example, tiie 

30 targeting sequence can direct a desired protein to a particular organelle such as a 
v^nele-or a plastid (eg., a chloroplast), ratiier tiian to the cytosol. Thus, the nucleic 
acid construct can fuitfa^ comprise a plastid transit peptide encoding DNA sequence 
operably linked between a promoter region or promoter variant according to the 
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invention and the foreign or endogenous DNA sequence. For example, refermce may 
be made to Heijne et al (1989, Eur. J. Biochem,, 180:535) and Keegstra et al. (1989, 
Ann. Rev. Plant Physiol Plant Mol BioL, 40:471), which are mcorporated herein by 
reference* 

5 [0188] The nucleic acid construct is typically introduced into a vector, such as a 
plasmid. Flasmid vectors include additional DNA sequences tiiat provide for easy 
selection, amplification, and transformation of the expression cassette in prokaryotic 
and eukaryotic cells, eg., pUC-derived vectors, pSK-derived vectors, pGEM-derived 
vectors, pSP-derived vectors, or pBS-derived vectors. Additional DNA sequences 
10 mclude origins of replication to provide for autonomous replication of the vector, 
selectable marker genes, preferably encoding antibiotic or heibicide resistance, unique 
multiple cloning sites providing for multiple sites to insert DNA sequences or genes 
encoded in the nucleic acid construct, and sequences that enhance transformation of 
prokaryotic and eukaryotic cells. 

15 [0189] TTie vector desirable contains an element(s) that permits either stable 
integration of the vector into the host cell genome or autonomous replication of the 
vector in the cell mdependent of the genome of the celL The vector may be integrated 
into the host cell gmome when introduced into a host cdl. For integration, the vector 
may rely on a foreign or endogenous DNA sequence present therein or any other 

20 element of the vector for stable integration of the vector into the genome by 
homologous recombination. Alternatively, the vector may contain additional nucleic 
acid sequences for directing integration by homologous recombination into the genome 
of the host cell. The additional nucleic acid sequences enable the vector to be integrated 
into the host cell genome at a precise location in the chromosome. To increase tiie 

25 likelihood of integration at a precise location, the mtegrational elemrats should 
preferably contain a sufficient number of nucleic acids, sudi as 100 to 1,500 base paiis, 
preferably 400 to 1,500 base paurs, and niost preferably 800 to 1,500 base pairs, which 
are hig^y homologous witii the corresponding target sequence to enhance the 
probability of homologous recombination. The integrational elements may be any 

30 sequence that is homologous with the target sequence in tiie genome of the host cell. 
Furthermore, the integrational elements may be non-encoding or encoding nucleic acid 
sequences. 
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[0190] For cloning and subcloning purposes, the vector may further comprise an 
origin of replication enabling the vector to replicate autonomously in a host cell such as 
abacterial cell. Examples of bact^al origins of replication are the origins of replication 
of plasmids pBR322, pUC19, pACYC177, and pACYC184 permitting replication in E. 
5 colU and pUBl 10, pE194, pTA1060, and pAMjSl pennitting replication in Bacillus. The 
origin of replication may be one having a mutation to make its function temperature- 
sensitive in a Bacillus cell (see, e.g., Ehrlich, 1978, Proc. Natl Acad. ScL USA 
75:1433). 

6^3.4 Marker genes 

10 (01911 To facilitate identification of transformants, the nucleic acid construct 
desirably comprises a selectable or screenable maricer gene as, or in addition to, a 
polynucleotide sequence according to the invention. The actual choice of a marker is 
not crucial as long as it is functional (i,e,, selective) in combination with the plant cells 
of choice. The marker gene and the R polynucleotide sequence of interest do not have to 

15 be linked, since co-transformation of unlinked genes as, for example, described in U.S. 
Pat. No. 4»399,216 is also an efficient process in plant transformation. 

[0192] Included within the terms selectable or screenable marker genes are genes 
that encode a "secretable marker** whose secretion can be detected as a means of 
identifying or selecting for transfomied cells. Examples include markers that encode a 

20 secretable antigen that can be identified by antibody interaction, or secretable enzymes 
that can be detected by their catalytic activity, Secretable proteins include, but are not 
restricted to, proteins tiiat are inserted or trapped in the cell wall {e.g,^ proteins that 
include a leader sequence such as that found in the expression unit of extensin or 
tobacco PR-S); small, diffusible proteins detectable, e,g. by ELISA; and small active 

25 enzymes detectable in extracellular solution (e.^., a-amylase, ^-lactamase, 
phosphinothricin acetyltransferase). 

6.3.5 Sdectable markers 
[0193] Examples of bacterial selectable markers are the dal genes from Bacillus 
subtllis or Bacillus licheniformis, or markers that confer antibiotic resistance such as 
30 ampiciiiin, kanamycin, erythromycin, chloramphenicol or tetracycline resistance. 
Exemplary selectable markers for selection of plant transformants mclude, but are not 
Umited to, a hyg gene which micodes hygromycin B resistance; a neomydn 
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phosphotransferase (neo) gene conferring resistance to kanamycin, paromomycin, G418 
and the like as, for example, desmbed by Potrjtais et al. (1985, MoL Gen. GeneL 
199:183); a glutathione-S-transferase gene fiom rat liver conferring resistance to 
glutathione derived herbicides as, for example, desoibed in EP-A 256 223; a glutamine 
5 synthetase gene conferring, upon overexpression, resistance to glatamine synthetase 
inhibitors such as phosphinothricin as, for example, described WO87/05327, an acetyl 
transferase gene fix)m Sfreptomyces viridochromogenes conferring resistance to ttie 
selective agent phosphinothricin as, for example, described in EP-A 275 957, a gene 
encoding a 5-enolshikimate-3-phosphate synthase (EPSPS) conferring tolerance to N- 

10 phosphonomethylglycine as. for example, described by Hinchee et al (1988, Biotech.^ 
6:915), a bar gene conferring resistance against bialaphos as, for example, described in 
W09 1/02071; a nitrilase gene such as bxn fiom Klebsiella ozaenae which confers 
resistance to bromoxynil (Stalker et aL, 1988, Science, 242:419); a dihydrofolate 
reductase (DHFR) gene conferring resistance to methotrexate (Thillet et aL, 1988, J. 

15 BioL Chem., 263:12500); a mutant acetolactate synthase gene (ALS), which confers 
resistance to imidazolinone, sidfonylurea or other ALS-mhibiting chemicals (EP-A-154 
204); a mutoted anthranilate synthase gene that confers resistance to 5-methyl 
tryptophan; or a dalapon dehalogenase gene that confers resistance to the herbicide, 

6.3.6 Screenabie markers 

20 (0194] Preferred screenabie markers include, but are not limited to, a uidA gene 
encoding a ^-glucuronidase (GUS) enzyme for which various chiomogenic substrates 
are known; a P-galactosidase gene encoding an enzyme for which chromogenic 
substrates are known; an aequorin gene (Prasher et aL, 1985, Biochem. Biophys, Res. 
Comm., 126:1259), which may be employed in caldum-sensitive bioluminescence 

25 detection; a green fluorescent protein gene (Niedz et a/., 1995 Plant Cell Reports^ 
14:403); a luciferase Que) gene (Ow et al, 1986, Science, 234:856), which allows for 
bioluminescence detection; a ^-lactamase gene (SutcHffe, 1978. Proc. Natl Acad. ScL 
USA 75:3737), which encodes an enzyme for whidi various chromogenic substrates are 
known (e.g;, PADAC, a chromogenic cephalosporin); an R-locus gene, encoding a 

30 product that regulates the production of anfliocyanin pigments (red colour) in plant 
tissues (Dellaporta et al, 1988, in Chromosome Structure and Function, pp. 263-282); 
an a*amylase gene (Ikuta et al, 1990, Biotech., 8:241); a tyrosinase gene (Katz et al, 
1983, J. Gen, Microbiol, 129:2703) which encodes an razyme capable of oxidismg 
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• tyrosine to dopa and dopaquinone which in turn condenses to fonn the easily detectable 
compound melanm; or a jq^IE gene (Zukowsky et al., 1983, Proc. Natl Acad. ScL USA 
80:1101), which encodes a catechol dioxygenase that can convert chromogenic 
catechols* 

5 7. Introduction of Ute nucleic acid construct into plant cells 

[01951 The sequences of flie present invention can be used to transform or transfect 
any plant. In this manner, genetically modified plants, plant cells, plant tissue, seed, and 
the like can be obtained. Transfonnation protocols as well as protocols for introducing 
nucleotide sequences mto plants may vary depending on the type of plant or plant cell, 
10 i.e. monocot or dicot, targeted for transformation. It is recognised that the 
transfonnation protocols may be used for transfection or introduction of the 
oligonucleotide sequences to disnq^t R gene function. Suitable methods of introducing 
nucleotide sequences into plant cells and subsequent insertion into the plant genome 
include microinjection (Crossway et al, 1986, Biotechniques 4:320-334), 
15 electroporation (Riggs et al., 1986, Proc, Natl Acad ScL USA 83:5602-5606), 
Agrobacterium-mediated transformation (Townsend et al,, U.S. Pat No. 5,563,055; 
Zhao et al, U.S. Pat No. 5,981,840), direct gene transfer (Paszkowski et aL, 1984, 
EMBO /. 3:2717-2722), and ballistic particle acceleration (see, for example, Sanford et 
al., U.S. Pat No. 4,945,050; Tomes et al., U.S. Pat. No. 5,879,918; Tomes et al, U.S. 
20 Pat No. 5,886,244; Bidney et at, U.S. Pat No. 5,932,782; Tomes et al (1995) "Direct 
DNA Transfer into Intact Plant Cells via Microprojectile Bombardment," in Plant Cell, 
Tissue and Organ Culture: Fundamental Methods, ed. Gamborg and Phillips (Springer- 
Verlag, Berlin); and McCabe et aL, (1988, Biotechnology 6:923-926). Also see 
Weissinger et al (1988 Ann. Rev. Genet. 22:421-477), Sanford et al, (1987, Particulate 
25 Science and Technology S:27-37; onion). Christou et al, (1988, Plant Physiol 87:671- 
674; soybean); Datta et al, (1990, Biotechnology 8:736-740; rice), Klein et al (1988, 
Proa Natl Acad. ScL USA 85:4305-4309, maize), Hooykaas-Van Slogteren et al 
(1984, Nature (London) 311:763-764; cereals), Bowen et al, (U.S. Pat. No- 5,736,369; 
cereals), Bytebier et al., (1987, Proc. Natl Acad Sci. USA 84:5345-5349; liliaceae), 
30 De Wet er al (1 985, in The Experimental Manipulation of Ovule Tissues, ed, Ch^man 
et al (Longman, N.Y.), pp. 197-209; pollen), Kaqipler et al, (1990, Plant Cell Reports 
9:415-418; 1992, near. Appl Genet. 84:560-566; whisker-mediated transfonnation), 
D'Halluin et al. (1992, Plant Cell 4:1495-1505; electroporation); Li et al, (1993, Plant 
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Cell Reports 12:250-255; rice), Christou and Fonl (1995, Annals ofBota^ 75:407-413; 
rice) and Osjoda et al. (1996, Nature Biotechnology 14:745-750; maize via 
Agrobacteriiun tumefeciens). Guidance in the practical implementation of 
transformation systems for plant improvment is provided by Birch {1997, Anna, Rev. 
5 Plant Physiol Plant Molec, Biol 48: 297-326). 

(0196J In certain embodmients, the present invention is concerned with 
transfonning monocotyledonous plants, including graminaceous and non-graminaceous, 
monocotyledonous plants. Illustrative examples of non-graminaceous 
monocotyledonous plants inlcude, but aie not limited to, Musaceae (Musa and Ensete), 

10 taro, ginger, onions, garlic, pineapple, bromeliaeds, pahns, orchids, lilira, irises and die 
like. There are a variety of mettiods known currently for transformation of 
monocotyledonous plants. Presently, preferred mefliods for transformation of monocots 
are microprojectile bombardment of explants or suspension cells, and direct DNA 
uptake or electroporation as, for exaniple, described by Shimamoto et al (1989, supra). 

15 Transgenic maize plants have been obtained by introducing the Sireptomyces 
hygroscopicus bar gene into embryogenic cells of a maize suspension culture by 
microprojectile bombardment (Gordon-Kamm, 1990, Plant Cell, 2:603-618), The 
mtroduction of genetic material into aleurone protoplasts of other monocotyledonous 
crops such as wheat and barley has been reported (Lee, 1989, Plant Mol BioL 13:21- 

20 30), Wheat plants have been regenerated finom embryogenic suspension culture by 
selecting only the aged compact and nodular embryogMuc callus tissues for the 
establishment of the embryogenic suspension cultures (Vasil, 1990, Bio/Technol 8:429- 
434), The combination with transformadon systans for these crops enables the 
application of the present invention to monocots. These methods may also be q»plied 

25 for the transformation and regeneration of dicots. Transgenic sugarcane plants have 
been regenerated from embryogenic callus as, for example, described by Bower et al. 
(1996, Molecular Breeding 2:239-249), 

& Production and charaderisaiion of differ entUaed transgenic plants' 

8.1 Regeneration 

30 [0197] The methods used to regenerate transformed cells into differentiated plants 
are not critical to this invention, and any method suitable for a target plant can be 
enqjIoyeA Normally, a plant cell is regenerated to obtain a whole plant following a 
transformation process. 
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[0198] RegOTeration ftom protoplasts varies fiom species to species of plants, but 
generally a suspension of protoplasts is made first Jn certain species, embryo formation 
can then be induced fiom the protoplast suspension, to the stage of lipening and 
germination as natural CTibryos, The culture media will generally contain various amino 
5 acids and hormones, necessary for growth and regeneration. Examples of hormones 
utilised include auxins and cytoldnins. It is sometunes advantageous to add glutamic 
acid and proline to the medium, especially for such species as com and alfalfa, Efficimt 
regmeration will depend on the medium, on the graotype, and on the history of the 
culture. If these variables are controlled, regenemtion is reproducible. Regeneration also 

10 occurs from plant callus, explants, organs or parts. Transformation can be perfimned in 
the context of organ or plant part regeneration as, for raanqile, described in Methods in 
Enzymology, Vol. 118 and Klee et aL (1987, Annual Review of Plant Physiology, 
38:467), which are incoiporated herem by reference. Utilising the leaf disk- 
transfonnation-regeneration method of Horsch et at. (1985, Science, 227:1229, 

15 incorporated herein by reference), disks are cultured on selective media, followed by 
shoot foraiation in about 2-4 weeks. Shoots that develop are excised from calli and 
transplanted to appropriate root-inducing selective medium. Rooted plantlets are 
transplanted to soil as soon as possible after roots ^pear. The plantlets can be repotted 
as required, until reaching maturity. 

20 [01991 hi vegetatively propagated crops, the mature transgoiic plants are 
propagated by the taking of cuttings or by tissue cidture techniques to produce multiple 
identical plants. Selection of desirable transgenotes is made and new varieties are 
obtained and propagated vegetatively for conunercial use. 

[0200] In seed propagated crops, the mature transgenic plants can be self-crossed to 
25 produce a homozygous inbred plant. The inbred plant produces seed containing the 
newly introduced foreign gene(s). These seeds can be grown to produce plants that 
would produce the selected phenotype, e.g., early flowering. 

[02011 Parts obtained fiom the regenerated plant, such as flowers, seeds, leaves, 
branches. Suit, and the like are included in the invmtion, provided that these paits 
3D comprise cells fliat have been transformed as d^cribed. Progeny and variants, and 
mutants of the regenerated plants are also inchided within the scope of the invention, 
provided that these parts comprise the introduced nucleic acid sequences. 
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[0202] The literature describes numerous techniques for regenerating specific plant 
types and more are continually becoming known. Those of ordinary skill in the art can 
refer to the literature for details and select suitable techniques without undue 
experimentation. 

5 8.2 Characterisation 

[0203] To confinn &e pr^ence of a R polynucleotide of the invention m the 
regenerating plants> a variety of assays may be performed* Such assays include^ for 
exaiDple, •^molecular biological" assays well known to tfiose of skill in the ait, such as 
Southern and Northern blotting and PGR; an R protem expressed by the polynucleotide 

10 of ttie invention may be assayed using antig^-binding molecule as for «ample 
described herein. 

[02041 In order that tiie invention may be readily understood and put mto practical 
effect, particular pieferred embodiments will now be described by way of the foUowmg 
non-limiting example. 
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EXAMPLES 

EXAMPLE 1 

Identification of R genes from M acuminata 

CTAB total DNA pur^UaUon 
S [0206] Genomic DNA was ^tracted using the CTAB protocol of Stewart & Via 
(1993). Briefly, leaf tissue was frozen in liquid N2 and ground in a mortar and pestle. 
Powdered tissue was resuspended in CTAB Buffer (1% Sarcosine, 0.8 M NaCI, 0.022 
M EDTA pH8,0, 002 M Tris-HCl pH 7.8, 0.8% CTAB, 0.14 M Mannitol) at 65^C. An 
equal volume of chlorofoim:isoamyalcohol (24:1) was immediately added, mixed by 
10 inversion and incubated at 65^C for 10 min wiiOi occasional inversion. Samples were 
centrifiiged for 5 min at 14000 ipm in a micaofiige to separate phases. The aqueous 
layer was collected and an equal volume of isopropanol added* DNA was spooled out, 
washed in 70% ethanoU and allowed to dry before resuspending in 100 fiL dH20 
containing RNaseA (1 mg/mL). 

15 PuHficaHon of total UNA 

[0206] Total RNA extractions were performed using the method of Chang et al 
(1993), Tissue was fix)Z6n in liquid N2 and ground to a powder in a mortar and pestle. 
Powdered tissue was added to preheated (65** C) extracHon buffer (2% CTAB, 2% PVP, 
100 mM Tris HCl pH8, 25 mM EDTA, 2 M NaCl, 0.05% spermidine, 2% beta- 

20 mercaptoethanol). Chloroformrisoamyialcohol (24:1) was added, the su^ension 
vortexed, and samples centrifiiged at top speed in a microfuge for 5 min. The aqueous 
phase was collected and an equal volume of DEPC-treated 4M LiCI added. RNA was 
precipitated overnight 4^ C and tiien centrifoged at 4^ C for 30 min at top speed. 
Pelleted RNA was resuspended in 10 X SSTE and extracted once more with 

25 chlorofonn:isoamyialcohol (24:1). The RNA was reprecipitated at -20" C overnight 
following the addition of 1/10 volume DEPC-treated 2.5 M NaOAc pH6-0 and 21/2 
volumes of 100% ethanol Tubes were centrifiiged 20 min, the pellets washed with 
70% ethanol and resuspended in DEPC-treated dHsO. 

Reverse-transcriptase PCR of banana R-genes 

30 [0207] Sequences of R-genes &om plant species were aligned and deg^erate 

primers designed to conserved moti& in the NBS regions. The degenerate primers were 
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used to generate single-stranded cDNAs fiom total RNA using reverse transcriptase and 
then to subsequently ampUfy the NBS region of the banana R-genes, To generate the 
region 5' of the NBS domain, RNA primers were ligated to the 5' end of the mRNA 
after removal of the 5'^ap stiuctnre. Ligated mRNA was reverse transcribed using 

5 reverse transcriptase to generate single-stranded cDNA. Primer complementary to the 
Ugated RNA primer and a specific primer to the known NBS sequence was added and 
PCR undertaken to generate the 5' region of the R-gene using the parameters ofi initial 
denaturation step of 94° C for 2 min foUowed by 5 cycles of 94» C for 30 sees. 55-65- C 
for 30 sees. 72» C for 3-5 min, then 25 cycles of 94* C for 30 sees. 45-60' C for 30 

10 sees. 72«> C for 3-5 min, followed by a final annealing step 72» C for 10 min. N-terminal 
and C-temiinal primers were subsequently used to ampUfy complete R-gene sequences 
from genomic DNA using PCR with flic foflowing conditions: initial denaturation step 
of 94» C for 2 min, followed by 25 cycles of 94" C. for 30 sees. 55° C for 30 sees. 
72° C for 1-5 min, followed by a final annealing step 72' C for 10 min. All PCR 

15 products were cloned and sequenced to verify identity. Hie fuU-length nucleotide 
sequences for two R genes, one isolated from Musa acuminata (Calcutta 4) designated 
RGA5 and the other from Musa acuminata spp maiaccensis designated RGA2. are 
presented in SEQ ID NO: 1 and 3, respectively. 

[0208] RT-PCR was then used to compare the expression of the R genes between 
20 M acuminata spp maiaccensis plants that were susceptible or resistant to Fusarium 
oxysporum &p cubense (FOQ. The results presented in Figure 3 show fliat the RGA2 
gene (see lanes C2) is transcribed in FOC resistant plants but not in FOC sensitive 
plants. This suggests that RGA2 may be an attractive candidate for conferring disease 
resistance to susceptible plants. The inventors propose to transform Cavendish, which is 
25 resistant to race 1 but susceptible to race 4, (i) with RGA2 onl>r, (ii) with RGA5 and (iii) 
with both RGA2 and RGA5, under the control of a heterologous promoter (e.g.. Ubi) or 
the native RGA2 promotw. 
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The disclosure of every patent, patent ^plication, and publication cited herein is hereby 
incorporated herein by reference in its entirety, 

10209] The citation of any reference herein should not be construed as an admission 
that such reference is available as 'Trior Art" to the instant ^plication. 

5 10210] Those sldlled in fhe art will appreciate that the inv^tion described herein is 
susceptible to variations and modifications other tihan those specifically described. It is 
to be understood that the invention includes all such variations and modifications. The 
invention also includes all of the steps, features, compositions and compounds referred 
to or indicated in this specificationf individually or collectively, and any and all 
10 combinations of any two or more of said steps or features. 

DATED this 25 S^tember, 2003 

QUEENSLAND UNIVERISTY OF TECHNOLOGY 

By their Patent Attorneys 

DAVmS COLUSON CAVE 
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Ctt aga cag tgt ttc get ttt tgc get gtg ttt cae aaa aat ta^ 
Leu ^ Oln cys Phe Ala Phe C?s Ila Si pSe nil Lys Isp ^ JeJ 
^-^^ 440 445 



490 495 



590 



1246 



1296 



1344 



1392 



ttc tat aaa cat gag ttg ate cag act tgg att gca gaa ggc ttc att 
Phe xyr I,ys His Glu Leu He Gin Tl^ t^J He Ila Ilu lly xll 

455 460 

s: SI s: IS 111 - -I - - - -u ^ >«. 

475 4ao 

Ilu ill v»i ^''^^ "'^ '^'^^^ '=39 tgg aga ggg 

His Glu Leu val Asn Arg Ser Phe Phe Gin Glu Ser Ar| Trp Arg lly 



1488 



cga tat gtg atg cat gae etc ata cac gat ctt occ caa t^^ at-^ i-^, 
Arg Tyr val Met Hie Aap Leu He His Lp Su fit Gin 5Se tit Sir 
500 SOS 510 

S*? I?^ '=9^ agg ata gat gat gac aag tec aaa gaa ace cct 
val Gly Glu Cys His Arg He Asp Asp Lp Lyf Ser LyJ IS Sr S« 
515 520 525 

IS ^ SI s s':; is ?s is s^s ^ 

535 540 

x®,*"? ^^'^ 9gt tac aat aaa ttg egg aco ctt atg ate aae aat 
val ASP Phe ser Gly Asn Lys Leu A^g ihr Leu Mel til tin Sn 

^50 555 

cag aga aat cag tat cca tat atg act aaa gtc aac age tqc tta tta 
Gin Arg Asn Gin Tyr Pro Tyr Met Thr Lys ?al Asn gs 

565 S70 575 

t^l c^*" °'=9 aaa aga ate cat gtt tta gtt ttg 1775 

Pro His Ser Leu Phe Lys Arg Leu Lys Arg He His val Leu Val 22 
580 535 



1536 



1584 



1632 



1680 



172B 



1824 



1872 



T ^ ^ 2?"" att ate ggt gac ttg ata 

Gin Lys cys Gly Met Lys Glu Leu Pro Asp He He lly Lp Leu He 

€00 605 

ri« f " 1*^ ""^^ ^^"^ get tgc att cag agg ttg 

Gin Leu Arg Tyr Leu Asp He Ser Tyr Asn Xla C?s He Gin S 

615 620 

ccc gag tea ttg tgc gac ctt tac aat ctg caa gca ctg agg eta taa issa 
Pro Glu ser Leu Cys Asp Leu Tyr Asn Leu Gin Ila Leu ^g LeS T% 

635 640 
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725 73Q 



735 



gag tta gag tgg gca get ggc cag gtt tec age ttg gag cat gag tta 
Qlu Leu Glu Trp AXa Ala Gly Gin Val Ser sir Lei 111 h" III Leu 
740 745 



750 



2u S^f ""^^ ttC etc 

Leu val ser Glu Glu Val Leu Leu Gly Leu Gin Pro His His Phe Leu 
'55 760 



755 



196B 



2016 



2064 



ggc tgt caa tta egg agt ttc cca caa ggc atg age aaa cto ate aao 
Gly Cys Gin Leu Arg Ser Phe Pro Gin Ily „e? HI lys Se 

550 655 

rlf, IfH f*"" ^ 3^'= *ta att tec aag ata tat gag 

Leu ftrg Gin Leu Arg Val Glu Asp Glu He He Ser LyI He Sr IS 
660 665 fi7o 

fJ"^ f*^ "^^^ ^^^^ ^"^^ Saa ttg tct gca ttc aaa gtg eta 

val Gly Lys Leu He Ser Leu Gin Glu Leu Ser La Phe Si 2u 

680 $85 

aat aat cat gga aac aaa ctt gca gaa eta agt ggt ttg aca caa etc 2112 

""^^ sir lly Le? ?hr oll 2u 

695 700 

ff*^ ^'^'^ ett gaa aat gta ggg agt aaa aaa 

Arg ser Thr Leu Arg He Thr Asn Leu Slu Asn ?al lly s£ t^l gS 

71S 720 

nT» f^^ ff*" aaa cag tat ctt gaa gca tta 

Glu Ala Ser Lya Ala Lys Leu His Arg Lys Gin Tyr Leu Glu Ila Leu 



2160 



2208 



2256 



2304 



2352 



2400 



2448 



aaa agt ttg aca ate aga ggg tac agt ggt gca aca gta ccc agt toa 
LyB ser Leu Thr He Arg Gly lyr Ser Gly La Thr Val Pro s« ^ 

ctg gat gtg aaa atg eta ceg aac ttg gga act ctt aaa eta gag aac 
Leu ASP val Lys Met Leu Pro Asn Leu Gly Thr Leu Lye Leu §lu Asn 

''SO 795 800 

tgt aca aga ctg gag ggt ctt tea tat att gga caa ctg cca eat etc 
Cys Thr Arg Leu Glu Gly Leu Ser Tyr He Gly Gin Leu Pro h2s Leu 
805 810 815 

aag gtc ctt cat atg aag aga atg eet gtg gtg aaa eaa atg aat cat 2496 
Lys Val Leu His Met Lys Arg Met Pro Val Val Lys Gin Me? sS Ss 
820 825 830 

r^^ ^ ^^'^Sr ttc cct agg eta gaa gag 2544 

Glu Leu cys Gly Cys Thr Lys Ser Lys Leu Phe Pro A^J Leu llu llu 
835 840 845 

T« t!^ ^^^9 '^te ceg aat ctt gca 2592 

^ I-ys GXu Phe Pro Asn Leu Ila 
855 860 

«^ ^1 f*'*' atg ttt goa gta 2640 

Gte Leu Pro Cys Leu Lya He He His Met Lys Asn Met Phe Ala Val 
^ 870 875 

aaa cat ata ggt cgt gaa tta tat ggt gat ata gag age aat tgt ttt 2688 
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hya Hie lie Oly Arg Glu Leu Tyr Oly Asp lie Glu Ser Asn cys Phe 

890 895 

eta tea tta gaa gag ctt gtg ctg cag gac atg ctg aca tta oag gaa 2736 
Leu ser Leu Glu Glu I*u Val Leu Gin Asp Mel LeS Tbr Le? 0?2 llu 
30° 905 910 

etc cea aat ctt gga caa ctt cca cat ett aag gtt att oac ata aaa ^ipa 
Leu Pro Asn Leu Gly Glu Leu Pro His Leu Lyf Si Si Ss mt lyt 



925 



aac atg tct gca ctg aaa ctt ata ggt cgt gaa tta tgt qat tct aaa or-^o 
Asn Met ser Ala Leu Lye Leu He lly ^ liu Leu gs Ser S 
930 335 ^ » 



940 



Ilu Lv« rll II V"" """^ ^"^^ 9^9 =tg aag aac atg 

Glu Lys lie Trp Phe Pro Arg Leu Glu Val Leu Val Leu Lys Asn Met 

550 

tin l^t T«» n?^ ft* f""" ttg gac aac ttc cgt gtc tea aga 

Leu Ala Leu Glu Glu Leu Pro Ser Leu Asp Asn Phe Arg Val Ser aL 
5«5 970 975 

ph! o^f t^^ Staa gta ggc oat gga etc ttt agt get acg 

Phe Phe Ala Ser Ser Val Glu Val Gly His Gly Leu Phe sir Ila Th? 

985 990 

f*** ^V" ctg gaa gag eta gaa ate aag ggc atg 3024 

Arg Asn Lys Trp Phe Pro Arg Leu Glu oiu Leu Glu He LyI lly Met 
»S5 1000 1005 

^ ®f9 gaa etc cat tct ctt gaa aaa ctg cea tgt etc 3069 
loio 1015 



103O 



1080 



2880 



2928 



2976 



?, f fr^T Sga ttg cea gca gtg aaa aag ata ggc 3114 

^^"^ *^ val Lys LyI He If? 

^025 1030 1035 

His Ilv p^! P"" 0°'= "^^^^ ^9*^ ttt cea agg 3159 

?il„ ^ ^^"^ Arg Glu Cys Phe Pro A^g 

1045 lOSO 

ttg gaa gat etc gta tta age gac atg cca gca tgg gaa gag tga 3204 
Leu Qlu Asp Leu Val Leu Ser Asp Met Pro Xla llu II? S 

^°5S 1060 1065 

III fi" Hf^ *'9C ttg tgt aga ctt 3249 

1070 JJvs ^" ^ 



ttt ril f?. «f ^=90 ttg ctt ecc ate ect 3294 

^ fi!. ^'"^ I'ys eye Leu Leu Pro He Pro 

1085 1090 1095 

oat tct etc ata aaa ctt gaa tta tgg caa gtt ggg ctg aca gga 3339 
Has Ser Leu He Lys Leu Glu Leu Trp Gin Val 5ly Leu Thr Gly 

1105 1110 ^ 

Ctt cca gga tta tgc aaa gga att ggt gga ggt age age act aga 3384 
Leu Pro Gly Leu Cys Lys Gly He Gly Gly Gly Ser ser Thr aIq 
1115 1120 " 



1125 
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''-"^ 1140 

aga aat ctg gga gaa ggg ttg eta tea aac cac eta 
Arg Asn Leu Gly Glu Gly Leu Leu Ser Sn ill 

1150 1155 



cca cat ate 
Pro Hie He 



aat get att egg ata tgg gaa 
Asn Ala He Arg lie Trp Glu 
1160 1165 



tgt get gaa ctg ttg 
Cys Ala Glu Leu Leu 
1170 



gtc aag agg ttt aga gaa ttc acc acc ctt gag aac 
Val Lys Arg Phe Arg Glu Phe Thr Thr Leu GlS 

1"5 1180 1185 



tgg ctg cct 
Trp Leu Pro 



ttg tea ata 
Leu Ser He 



agg aac tgc ccc aag etc atg 
Arg Asn Cys Pro Lys Leu Met 
1190 1195 

gac etc etc etc ccg ecg tta 
Asp Leu Leu Leu Pro Pro Leu 
"OS 1210 



age atg aca cag tgt gag gag aat 
ser Met Thr Gin Cys Glu Glu Asn 
1200 



ate aag gca eta gaa 
He Lys Ala Leu Glu 
1215 

tgt gga aat ctt ggg aaa teg ctg cct acra toe r*f-a 
cys Gly Asn Leu Gly Lys Ser Leu P« lly Su 
"20 1225 1230 

e!^ V"^ !f ^^Sr ata tec aat tgt cca 

fo« "^"^ ''^^ Asn cS Pro 

1240 1245 



ttg ggt gac 
Leu Gly Asp 



cac aac etc 
His Asn Leu 



tac atg gta 
Tyr Met Val 



tec ctt cca agg gaa gta atg 
Ser Leu Pro Arg Qlu val Met 
1250 1255 



ctt cac ttg aag gaa 
Leu His Leu Lys Glu 
1260 



ctt gga act 
Leu Gly Thr 



?s £ jj: is ig 2? - - - - 

1270 1275 



caa gtt 
Oln Val 



etc aaa tea etc aag aga ttg gca ate ata gga tgt ccc 
1280 ^'^ ills Sso 

agg ctt ttg eta aat gaa ggg gat gag caa ggg gag gto tta tea 
ArgLeu i^uLeuAsnOluGly Asp Glu Gin If? ?a? Jeu S« 

1300 1305 



Ctg ctt gaa tta tea gta gat 
lieu Leu Qlu Leu Ser Val Asp 
1310 1315 



aaa aca gee eta ctt aaa etc tea 
Lys Tlir Ala Leu Leu Lys Leu Ser 
1320 



ctt ata aaa aat aca eta cca ttc ate cat tet etc am ai-r. s.i-r, 
I.eu Ile^ Lys Asn Thr Leu Pro^ Phe He Kit III w ^ Vll xll 



tgg tet cct cag aaa gtg atg 
Trp Ser Pro Gin Lys Val Met 
1340 1345 



ttt gac ttg gag gag 
Phe Asp Leu Glu Glu 
1350 



cag gaa ttg 
61n Glu Leu 



gtg cac age etc aca get etc agg cgc ctt gaa ttc 
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3429 



3474 



3519 



3564 



3609 



3654 



3699 



3744 



3789 



3834 



3879 



3924 



3969 



4014 



4059 



4104 
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Val His Ser Leu Thr Ala Leu Arg Arg l«u Olu Pbe Phe Arg Cys 
1355 1360 1365 

Lye Asn Leu Gin Ser Leu Pro Thr Glu Leu His Thr Leu Pro Ser 
"■'O 1375 1380 

etc cat get ttg gtt gta agt gac tgc cca cag ate caa tea ctg 4194 

V*^ ^^'^ Pro Gin lie Gin ser Leu 

"85 1390 1335 

ccg gag aag gga etc ccg aea etc etc aca gat tta gga ttt gac 4239 

1405 1410 

S^! ^° ^'^'^ ^^'^ 9^=3 ^'^a 9«a aag cac ctg goa 4284 

Kxs Cys Hx8 Pro Val Leu Thr Ala Gin Leu Glu Lys His Leu Ila 
"J^S 1420 . 142S 

1?^ t^f f*^ SSrt cga ttt cac cca gtt tat gca taggcaacat 4333 

Glu Met Lya Ser Ser Gly Arg Phe His Pro Val Tyr Ila 
"30 1435 1440 

gagtgaggat ggagaaaggg gagtggaaga gaaagatttc gattgce 4380 



<210> 2 

<211> 1441 

<212> PRT 

<213> Musa acuminata 



<400> 2 

Met ser Thr Ala Leu Val He Gly oly Trp Phe Ala Gin Ser Phe lie 
1 S 10 



is 



Gin Thr Leu Leu Asp Lya Ala Ser Asn Cys Ala He Gin Gin Leu Ala 
20 25 30 

Arg Arg Arg Oly Leu His Asp Asp Leu Arg Arg Leu Arg Thr Ser Leu 
35 40 45 

Leu Arg He His Ala He Leu Asp Lys Ala Glu Thr Arg Trp Asn His 
50 55 60 

Lys Asn Thr Ser Leu Val Glu Leu Val Arg Gin Leu Lys Asp Ala Ala 
65 70 75 

Tyr Asp Ala Olu Asp Leu Leu Glu Glu Leu Glu Tyr Gin Ala Ala Lys 
85 90 95 

Gin Lye val Glu His Arg Gly Asp Glu He Ser Asp Leu Phe Ser Phe 
100 105 110 



Leu ser thr Ala Ser Olu Trp Leu Gly Ala Asp Gly Asp Asp Ala 
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120 125 



61y Arg Leu Arg Glu He Gin Gly tya Leu Cys Asn lie Ala 

13S 



Ala 



Asp Met Met Asp Val Met Gin I^u Leu Ala Pro Asp Asp Gly Gly Ara 

155 ISO 

cm Pbe Asp Trp Lys Val Val Arg Arg Glu Thr Ser Ser Phe Leu Thr 



16S 170 



175 



Glu Thr Val val Phe Gly Arg Asp Gin Glu Arg Glu Lys Val Val Glu 
180 X85 190 



l*u Leu Leu Asp Ser Gly Ser Gly Asn Ser Ser Phe Ser Val Leu Pro 

200 205 

Gly Lys Tbr Thr Leu 
21S 220 



Leu val Gly He Gly Gly Val Gly Lys Thr Thr Leu Ala Gin Leu Val 



Asn ASP Asn Arg val Gly Asn Tyr Phe His Leu Lys Val Trp val 
230 235 240 



CyB val ser Asp Asn Phe Asn Val Lys Arg Leu xhr Lys Glu He 



245 250 



255 



Glu ser Ala Thr Lys Val Glu Gin Ser Asp Lys Leu Asn Leu Asp 
260 265 270 

Leu Gin Gin He Leu Lys Glu Lys He Ala Ser Glu Arg Phe Leu 



280 



He 



Thr 



Leu 



285 



val Leu ASP Asp Val Trp ser Glu Asn Arg Asp Asp Trp Glu Arg Leu 

295 3QQ 

eye Ala Pro Leu Arg Phe Ala Ala Arg Gly Ser Lys Val He Val Thr 

3" 31S 320 

Thr Arg Asp Thr Lys He Ala Ser He He Gly Thr Met Lys Glu He 



325 330 



335 



Ser Leu Asp Gly Leu Gin Asp Asp Ala Tyr Trp Glu Leu Phe Lys Lye 

345 350 

cys Ala Phe Gly Ser Val Asn Pro Gin Glu His Leu Glu Leu Glu Val 
355 360 365 
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lie Qly Arg tys He Ala Gly Lys Leu Lys Qly ser Pro ieu Ala Ala 

375 380 

hYB Thr Leu Gly Ser Leu Leu Arg Leu Asp Val Ser Gin Glu His Trp 

395 400 

Arg Thr He Met Glu Ser Glu Val Trp Gin Leu Pro Gin Ala Glu Aen 
405 410 

Glu He Leu Pro Val Leu Trp Leu Ser Tyr Gin His Leu Pro Gly His 
*20 425 

Leu Arg Gin Cya Phe Ala Phe Cys Ala Val Phe His Lys Asp Tyr Leu 

440 445 

Phe Tyr L^s His Glu Leu He Gin Thr Trp He Ala Glu Gly Phe He 

4S5 450 

Ma His Gin Gly Asn Lys Arg Met Glu Asp Val Gly Ser ser Tyr Phe 
" .475 480 

His Glu Leu Val Asn Arg Ser Phe Phe Gin Glu Ser Arg Trp Arg Gly 

490 495 

Arg Tyr Val Met His Asp Leu He His Asp Leu Ala Gin Phe He Ser 

505 510 

Val Gly Glu Cys His Arg He Asp Asp Asp Lys Ser Lys Glu Thr Pro 



520 



525 



Ser Thr Thr Arg His Leu Ser Val Ala Leu Thr Glu Gin Met Lys Leu 
530 535 54^ 

val Asp Phe Ser Gly Tyr Asn Lys Leu Arg Thr Leu Met He Aen Asn 

SSO 555 

Gin Arg Asn Gin Tyr Pro Tyr Met Thr Lys Val Asn Ser Cys Leu Leu 

570 575 

Pro His Ser Leu Phe Lys Arg Leu Lys Arg He His Val Leu Val Leu 
SBO 585 590 

Gin Lys cys Gly Met Lys Glu Leu Pro Asp He He Gly Asp Leu He 

600 605 

Gin Leu Arg Tyr Leu Aap He Ser Tyr Asn Ala Cys He Gin Arg Leu 

-9- 



P:\Opei\Vpa\VPA ProvisC<mIs\InisariuinprovisioaaUoc 

25 S^teinter2003 

^" 620 



Asn 
655 



Pro Glu Ser Leu Cys Leu Tyr Asn Leu Gin Ala Leu Arg Leu Trp 

63S 640 

Gly cys Gin Leu Arg Ser Phe Pro Gin Gly Met Ser Lya Leu lie 
«45 

Leu Arg 61r» Leu Arg Val Glu Asp Glu lie He Ser Lya lie Tyr Gli, 

fi65 670 

Val Gly Lys Leu He Ser Leu Gin Glu Leu Ser Ala Phe Lys Val Leu 
"5 680 685 

Asn Asn His Gly Asa Lys Leu Ala Glu Leu Ser Gly Leu Thr Gin Leu 

695 7Q0 

Arg ser Thr Leu Arg lie Thr Asn Leu Glu Asn Val Gly 8er Lys Glu 

^" 715 720 



Glu Ala Ser Lys Ala Lys Leu His Arg Lys Gin Tyr Leu Glu Ala 



725 



Leu 



730 735 
Glu Leu Glu Trp Ala Ala Gly Gin Val ser Ser Leu Glu His Glu Leu 



740 745 



750 



Leu val ser Glu Glu Val Leu Leu Gly Leu Gin Pro His His Phe Leu 

760 765 

Lys ser Leu Thr He Arg Gly lyr Ser Gly Ala Thr Val Pro Ser Trp 



780 



Leu Asp Val Lys Met Leu Pro Asn Leu Gly Thr Leu Lys Leu Glu Asn 

79S BOO 

Cya Thr Arg Leu Glu Gly Leu Ser Tyr He Gly Gin Leu Pro His Leu 
805 aio 815 

hya val Leu His Met Lys Arg Met Pro Val Val Lya Gin Met Ser His 
"0 825 830 

Glu Leu Q^s Gly Cys Thr Lys Ser Lys Leu Phe Pro Arg Leu Glu Glu 

840 845 

Lew Val Leu Glu Asp Met Pro Thr Leu Lys Glu Phe Pro Asn Leu Ala 
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Gin teu Pro Cys Leu Lye He He His Met Lys Asn Met Phe Ala Val 

870 87S 880 

Lys His He Gly Arg Glu Leu Tyr Gly Asp He Glu Ser Aan Cys Phe 
"85 890 895 

Leu ser Leu Glu Glu Leu Val Leu Gin Asp Met Leu Thr Leu Glu Glu 

905 

Leu Pro Asn Leu Gly Gin Leu Pro His Leu Lys Val He Hie Met Lys 

920 925 

Asn Met Ser Ala Leu Lys Leu He aiy Arg Glu Leu Cys Asp Ser Arg 

940 

Glu Lys He TTp Phe Pro Arg Leu Glu Val Leu Val Leu Lys Asn Met 

»50 955 ggQ 

Leu Ala Leu Glu Glu Leu Pro Ser Leu Asp Asn Phe Arg Val Ser Arg 
965 970 975 

Phe Phe Ala Ser Ser Val Glu Val Gly His Gly Leu Phe Ser Ala Thr 
®80 9B5 990 

Arg Asn Lys Trp Phe Pro Arg Leu Glu Glu Leu Glu He Lys Gly Met 
»S5 1000 1005 

Leu Thr^ Phe Glu Glu Leu His^ Ser Leu Glu Lys Leu Pro Cys Leu 

*la Val Lys Lys He Gly 
1030 1035 

^ ^^'^ <=y« *rg Glu Cys Phe Pro A«j 
1045 1050 

Leu Qlu Asp Leu Val Leu Ser Asp Met Pro Ala Trp Glu Glu Trp 
loss 1060 1065 

^^"^ 1070 ^ ^^"^ ?i7s *^ ^^"^ 

^ I-eu Leu Pro He Pro 

1090 1095 

His Ser Leu Ha Lys Leu Glu Leu Trp Gin Val Gly Leu Thr Gly 
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1110 



2SSeplend>er2003 



Leu pro Qly Leu Cys Lys Gly lie 61y Gly Gly Ser Ser Thr Arg 

1120 X12S 

Thr Ala Ser Leu Ser Leu Leu His lie He Lya cys Pro Asn Leu 
1130 1135 1140 

Arg Asa Leu Oly Qlu Gly Leu Leu Ser Asa His Leu Pro His He 
1145 1150 13^55 

*^ ^ ^» ®l'i 1*" Leu Trp Leu Pro 

1165 1170 

Val Lys Arg Phe Arg Glu Phe Tlir Thr Leu Olu Asn Leu Ser He 
1175 1180 118S 

^ W«t Gin Cys Glu Glu Asn 

^^^^ 1195 1200 

1205 \lxQ ^^'^ ^^i" Asp 

Cys Gly Asn Leu Gly Lys Ser Leu Pro Gly Cys Leu His Asn Leu 
1220 1225 1230 

^^"^ f«= ^^'^ Asn cys Pro Tyr Met Val 

1235 1240 1245 

^V^^ ^ ^^'^ "^'^ l-eii l-ys Qlu Leu Gly Thr 

1250 1255 1260 

Asp Oly Leu Oly Ser He Glu Gly Leu 
1265 1270 1275 

Gin Val Leu Lys Ser Leu Lys Arg Leu Ala He He Gly Cys Pro 
1280 1285 1290 

Arg Leu Leu Leu Asn Glu Gly Asp Glu Gin Gly Glu Val Leu Ser 
1295 1300 1305 

Leu Leu Glu Leu Ser Val Asp Lys Thr Ala Leu Leu Lys Leu Ser 
1310 1315 1320 

Leu He Lys Asn Thr Leu Pro Phe He His Ser Leu Arg He He 
1325 1330 1335 
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ItL ^° *«P Gl'^ Glu Leu 

" 1345 13S0 

""^^ JaL ^^"^ ""^^ '^^ ^ '^'^ Pl^e Phe Arg Cys 

1355 1360 jggg 

^^'^ ^^"^ ''^^ His Thr Leu Pro Ser 

1370 1375 

Leu His Ala Leu Val Val Ser Asp Cys Pro Gin He Gin Ser Leu 
"85 1390 1335 

^ ?Jnn "-^^ Asp Gly Phe ASP 

1405 1410 

His cys Hie Pro Val Leu Thr Ala Gin Leu Glu Lye Hia Leu Ala 

1420 1425 

^'^y ^9 Hia Pro val Tyr Ala 

"30 1435 i^^j, 

<210> 3 

<211> 3699 

<212> ONA 

<213> Musa acuminata spp malacoeneis 

<:220> 

<221> CDS 

<222> (1) , . (3696) 

<400> 3 



ulf f?- ^''^ 9=9 9=9 9tg ttc tec ctg gtg aat 48 

Met Ala Aap Val Thr Pro Gin Ala Ala Ala Val Phe Ser Leu Val ^1 

^ 10 15 

^^"^ *=*=9 ate 9tc gcg gaa etc egg ttg 96 

Glu lie Phe Asn Arg Ser He Asn Leu He Val III Glu Leu SI Jeu 

20 25 30 . 

fl?^ 9ce gag ctg aac aat ctg cag aga aca eta ttg 144 

Gin Leu Asn Ala Arg Ala Glu Leu Ann Aen Leu Gin A^g Thr Leu £eu 
35 40 45 

agg act cac tct ctg etc gag gag gca aag gcg agg egg atg act aac 199 
Arg Thr His Ser Leu Leu Glu Glu Ala LyI III aS a^ Mel Thr 

55 50 

aag tct etc gtg ctg tgg ctg atg gag etc aag gaa tgg gee tac oac Siin 
Lye ser Leu Val Leu Trp Leu Met Glu Leu LyI llS I^a JSp 
®^ '0 75 ^ 80 

gee gac gae ate etc gac gag tac gag gee gca gca ate cga etc aaa 
Ala ABp Asp He Leu Asp Glu Tyr Glu Ila Ila lla Xll Sg Hi 



268 
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90 95 

vlt S S pie Itl f^*^ f" ^'^^ "^'^ ^5 336 

Tor Arg ser Thr Phe Lys Arg Leu lie Aap His val lie lie Asn 

lOS 110 

v^J S5 2; IS 5S ?s s: n; k si ^ ;s 

115 120 12S 

x^^^T flf*^ =tg ggt gcg ctg gaa ggg tcci cao 433 

val ar Leu Glu Arg Olu Leu Asn Leu ^ LI LeS IS I?? Se? gS ' 

135 140 



1^ f= ^'^S tct ctt ctg act gaa 

Pro L6U Asp ser Thr Lys Arg Gly Val thr Thr Ser Leu Leu Thr US 

^50 155 



tct tgt att gtc ggg cga get caa gat aag gag aat ttg att caa tta 
ser Cys lie Val Qly Arg Ala Gin Asp Lyl 111 Asn SJu til Je? 



170 175 



Ctg ttg gag ccc age gat ggg gcg'gtt cct gtt att cct ata at-t- r,^^ 
Leu Leu Glu Pro Ser Asp Gly til ^al Pro vS vll Pro Til Ify 

185 ' 

tta gga ggg gca ggg aag acg act ctg tct caq ctt ate ttfc aai- 
Leu Gly Gly Ala Gly Lys Thr Thr LeS Ser Oil III III ^l 

200 205 

aag aga gtg gag gag cat ttc cca ttg aga atg tog gtg tot oto tct 
Lys Arg Val Glu Glu His Phe Pro Leu A?g Me! tS v2 sSr 



480 



528 



576 



624 



672 



220 



720 



gac gat ttt gat gtg aag aga att act aga gag ate aca gag tac occ 
Asp Asp Phe Asp val Lys Arg He Thr iJg 111 He ^ III HI 

230 235 240 

2?* ace aac ttg aat atg ctt caa att 7«8 

Thr Asn Gly Arg Phe Met Asp Leu Thr Asn Leu Asn Me? Le^ Sn vll 
245 250 255 

^ rl^ f** 3ag ata agg ggg acg aca ttt ttg ctt gtg ctg gat fiis 
Asa Leu Lys Glu Glu He Arg Gly Thr Thr Phe Leu Leu Val Leu ISp 

265 270 



gat gtg tgg aac gaa gac ccc gtg aag tgg gaa age etg tta oec cca pcii 
ASP val Trp Asn Glu Asp Pro Val Lyf S Ilu si? lS SS HI pS 

280 2BS 

tta gat gcc gga gga egg gga age gtg gtc att gtg acg aca cacr aac 012 
Leu Asp Ala Gly Gly Arg Gly Ser Val Val He ?a? Th? ?hr Gin sir 

295 300 

aaa aag gtc gcc gat gtc acc ggc acg ata aaa coa tac cr^^- 1-1- 
Lys Lys val Ala Asp Val Thr l!y Th? Jit III HI fj, ^ 

310 315 32^, 

gag tta acg gag gat gac agt tgg tea etc ate gag agt cac tec ttc 
Glu Leu Thr Glu Asp Asp ser Trp ser Leu He 5lu sir Ss Ser til 
325 330 335 
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agg gag gcg age tgc tct agt aca aat cct aga atg gaa gag ate ggg 1056 
Arg Glu Ala Ser Cys Ser Ser Thr Aan Pro Arg Met Glu Glu lie Gly 
340 345 350 

agg aag ata gcc aag aag ate agt ggc eta cct tac gga gea aca gca 1104 
Arg Lye lie Ala Lys Lys lie Ser Gly Leu Pro Tyr Gly Ala Thr Ala 

355 360 365 

atg ggg aga tat eta aga tct aag cac gga gaa age age tgg aga gaa 1152 
Met Gly Arg Tyr lieu Arg Ser I»ys His Gly Glu Ser Ser Trp Arg Glu 
370 375 380 

gtc ttg gaa act gag act tgg gag atg cca ecg get gca agt gat gtg 1200 
Val lieu Glu Thr Glu Thr Trp Glu Met Pro Pro Ala Ala Ser Asp Val 
385 390 395 400 

tta tec get eta agg aga agt tac gac aat eta ccc ect cag ctg aag 1248 
lieu Ser Ala Leu Arg Arg ser Tyr Asp Asn Leu Fro Pro Gin Leu Lys 
405 410 415 

etc tgt ttt gcc ttc tgt get ctg ttt aca aag ggc tac agg ttt cga 1296 
Leu Cys Phe Ala Phe Cys Ala Leu Phe Thr Lys Gly Tyr Arg Phe Arg 
420 425 430 

aag gat aca ctg ate cac atg tgg ata get caa aat ttg att caa tea 1344 
Lys Asp Thr Leu lie His Met Trp lie Ala Gin Asn Leu lie Gin Ser 

435 440 445 

aca gag teg aaa aga teg gag gac atg gca gaa gaa tgc ttt gat gat 1392 
Thr Glu Ser Lys Arg ser Glu Asp Met Ala Glu Glu Cys Phe Asp Asp 
4S0 455 460 

ttg gtg tgc aga ttc ttc ttt egg tac tec tgg ggc aac tat gtg atg 1440 
Leu Val Cys Arg Phe Phe Phe Arg Tyr Ser Trp Gly Asn Tyr Val Met 
465 470 475 480 

aat gac tea gtc cat gac etc get cga tgg gtt tea ttg gat gaa tat 1488 
Asn Asp Ser Val His Asp Leu Ala Arg Trp Val Ser Leu Asp Glu Tyr 
485 490 495 

ttt cga gca gat gaa gac tea cca ttg cat att tea aag cca att cgt 1536 
Phe Arg Ala Asp Glu Asp Ser Pro Leu His lie Ser Lys Pro He Arg 
500 505 510 

eat ttg tea tgg tgc agt gaa aga ata ace aat gtt ett gag gat aat 1584 
His Leu Ser Trp Cys Ser Glu Arg He Thr Asa Val Leu Glu Asp Asn 
515 520 525 

aac act ggt gga gat get gtc aat ccg etc age agt ttg cgc act etc 1632 
Asn Thr Gly Gly Asp Ala Val Asn Pro Leu Ser Ser Leu Arg Thr Leu 
530 535 540 

ett ttc tta ggc caa tct gag ttc egg teg tat cat ett ctt gat aga 1680 
Leu Phe Leu Gly Gin Ser Glu Phe Arg Ser Tyr His Leu Leu Asp Arg 
545 550 555 560 

atg ttc agg atg ttg age cga ate cgt gtt ttg gat ttc age aac tgc 1728 
Met Phe Arg Met Leu Ser Arg He Arg Val Leu Asp Phe Ser Asn Cys 
565 570 575 

gtc ata aga aat ttg cct tct teg gtt gga aat ctg aaa eat ctg cgt 1776 
Val lie Arg Asn Leu Pro Ser Ser Val Gly Asn Leu Lys His Leu Arg 
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580 585 590 

tac ctg ggc ctg tct aat acg aga abt caa agg ttg ccg gag tct gta 1824 
Tyr Leu Gly Leu Ser Asn Thr Arg lie Gin Arg Leu Pro Glu Ser Val 
595 600 605 

aca cgt ctt tgc etc cfct cag aca ttg eta eta gag ggc tgt gaa ctg 1B72 
Thr Arg Leu Cys Leu Leu Gin Thr Leu Leu Leu Glu Gly Cys Glu Leu 
6X0 615 620 

tgc agg tta cca aga age atg age agg etc gtc aaa etg agg eag etc 1920 
Cys Arg Leu Pro Arg Ser Met Ser Arg Leu Val Lys Leu Arg Qln Leu 
625 630 635 640 

aaa gca aat cca gat gta att gcc gac ata gcc aaa gtc ggg aga ttg 1968 
Lys Ala Asn Pro Asp Val He Ala Asp He Ala Lys Val Gly Arg Leu 
645 650 655 

ate gaa ctt eaa gag etg aaa gcc tat aat gtt gae aag aaa aaa gga 2016 
Xle Glu Leu Gin Glu Leu Lys Ala Tyr Asn Val Asp Lys Lys Lys Gly 
660 665 670 

cat ggg att gca gag eta agt gca atg aat cag ctt cac ggt gat ctt 2064 
His Gly Xle Ala Glu Leu Ser Ala Met Asn Gin Leu His Gly Asp Leu 
675 680 685 

tec act aga aac ctt caa aat gta gag aaa acg cga gag tct egg aag 2112 
Ser Xle Arg Asn Leu Gin Asn Val Glu Lys Thr Arg Glu Ser Arg Lys 
690 695 700 

geg agg ttg gac gag aaa cag aag ctt aag etc ttg gat ctg cga tgg 2160 
Ala Arg Leu Asp Glu Lys Gin Lys Leu Lys Leu Leu Asp Leu Arg Trp 
705 710 715 720 

get gae ggt agg ggt gcc gga gaa tgt gat cgt gac agg aaa gtt ctt 2208 
Ala Asp Gly Arg Oly Ala Gly Glu Cys Asp Arg Asp Aarg Lys Val Leu 
725 730 735 

aaa ggc etc cga cca cat cca aac ctg aga gaa ttg agt ate aaa tae 2256 
Lys Gly Leu Arg Pro His Pro Asn Leu Arg Glu Leu Ser He Lys Tyr 
740 745 750 

tae gga ggc act tea tct ccg agt tgg atg acg gat cag tat ctg ccc 2304 
Tyx Gly Gly Thr Ser ser Pro Ser Trp Met Thr Asp Gin Tyr Leu Pro 
755 760 765 

aac atg gaa acg att egc ctg cgt age tgc gca agg ttg acg gaa etc 2352 
Aan Net Glu Thr xle Arg Leu Arg Ser Cys Ala Arg Leu Thr Glu Leu 
770 775 780 

cca tgt etc ggt cag ctg cat ate ctt aga cat ttg cac ate gat ggg 2400 
Pro Cys Leu Gly Gin Leu His Xle Leu Arg His Leu His Xle Asp Gly 
785 790 795 800 

atg tee eaa gtg aga caa att aat ctg caa ttt tat ggc ace gga gaa 2448 
Met Ser Gin Val Arg Gin Xle Asn Leu Gin Phe Tyr Gly Thr Gly Glu 
805 810 815 

gtt tea ggt ttt eca ttg ctg gag etc ctg aac ata cgt cgc atg ccc 2496 
Val Ser Gly Phe Pro Leu Leu Glu Leu Leu Asn Xle Arg Arg Met Pro 
820 825 830 
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agt ctg gag gaa tgg teg gaa cca egg aga aac tgt tgc tac ttc cct 2544 
Ser lieu Olu Glu Trp Ser Glu Pro Arg Arg Asn Cys Cys Tyi: Phe Pro 
835 840 645 

cgc etc cat aaa ctg ctg ate gag gat tgt ccc agg etc agg aat ctg 2592 
Arg Leu Kis Lys Leu Leu He Glu Asp Cys Pro Arg Leu Arg Asn Leu 
850 855 860 

ccc tec etc cca cca aca ctg gaa gaa eta agg at a tea aga aca gga 2640 
Pro Ser Leu Pro Pro Thr Leu Glu Glu Leu Arg He Ser Arg Thr Gly 
865 870 875 880 

eta gtt gat ctt cca gga ttc cat gga aac ggt gat gtg acg acg aat 2688 
Leu Val Asp Leu Pro Gly Phe His Gly Asn Gly Asp Val Thr Thr Asn 
865 890 895 

gtt tec ctt tct tct ttg cat gtt teg gag tgt cga gaa ctg aga tec 2736 
Val Ser Leu Ser Ser Leu His Val Ser Glu Cys Arg Glu Leu Arg Ser 
900 905 910 

eta age gaa gga ttg ttg cag cac aac etc gtc gcc etc aag aca gcg 2784 
Leu Ser Glu Gly Leu Leu Gin His Asn Leu Val Ala Leu Lye Thr Ala 
915 920 925 

gca ttt ace gat tgt gat tct ctt gag ttt ttg ccg gcg gaa gga ttc 2832 
Ala Phe Thr Asp Cys Asp Ser Leu Glu Phe Leu Pro Ala Glu Gly Phe 
930 935 940 

aga aca gcc att tea ctt gaa tea ttg ata atg act aat tgt cca ctg 2880 
Arg Thr Ala He Ser Leu Glu Ser Leu He Met Thr Asn Cys Pro Leu 
945 950 955 960 

cct tgc agt ttt ctt ttg cct tee tct etc gag. cat eta aag ttg cag 2928 
Pro Cys Ser Phe Leu Leu Pro Ser Ser Leu Glu Kis Leu Lys Leu Gin 
965 970 975 

cca tgc etc tat cca aac aac aat gag gat tea ctg tea aca tgc ttc 2976 
Pro Cys Leu Tyr Pro Asn Asn Asn Glu Asp Ser Leu Ser Thr Cys Phe 
980 985 990 

gag aac etc aca tct ctt tec ttc ttg gae ate aaa gat tgt cca aat 3024 
Glu Asn Leu Thr Ser Leu Ser Phe Leu Asp He Lys Asp Cys Pro Asn 
995 1000 1005 

ctg tea tea ttt cca ccg ggt cct eta tgt cag eta tea gca etc 3069 
Leu Ser Ser Phe Pro Pro Gly Pro Leu Cys Gin Leu Ser Ala Leu 
1010 1015 1020 

caa cat ttg tee etc gtc aat tgc cag agg eta caa tct att ggc 3114 
Gin His Leu Ser Leu Val Asn Cys Gin Arg Leu Gin Ser He Gly 
1025 1030 1035 

ttc cag gca etc acc tec etc gaa age ttg aca att cag aac tgc 3159 
Phe Gin Ala Leu Thr Ser Leu Glu Ser Leu Thr He Gin Asn Cys 
1040 1045 1050 

cct cgc etc acc atg tea cac agt ttg gtt gag gtg aat aac tct 3204 
Pro Arg Leu Thr Met Ser His Ser Leu Val Glu Val Asn Asn Ser 
1055 1060 1065 

tec gat aca ggg etc gcg ttt aat ate act cga tgg atg cgc aga 3249 
Ser Asp Thr Gly Leu Ala Phe Asn He Thr Arg Trp Met Arg Arg 
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"■'O 1075 1080 

cga aca ggt gac gac ggc ttg atg etc aga cac cga gca caa aat 329« 
Arg TJr Gly Asp Asp Oly Leu «e£ Leu A?g His Sg HI g2 

."85 1090 jogg 

ttl t't*' ^^"^ ""^^ ''^^ etc acc ttc etc cag 3339 

^ ^'^^ =i« ^" Thr Phe Leu 

■m'" 1105 1110 

III rl^ f*^ «° '^^'^ ^to gta acc etc ace ggc 3384 

™ ^ ^^"^ «1» Val Thr Phe Thr lly 

1115 1120 1125 

f?^ Sf* ^3 ^^'^ act tct ctt caa att ctg 3439 

Glu Olu Olu Lys Trp Arg Aan Leu Thr Ser Leu Gin lie tZl 
1130 1135 1140 

?^!c ^ ^'^^ ^ I-eu Pro Ala Asn Leu 

1150 1155 

caa age etc tge tec etc age acc ttg tac ate gtc aga tgc coa as 19 

llfiO 1165 1170 

He Iif Ph*" ^3*^ S^*' ^»=9 tec etg gca 3564 

„ "-^^ ^''^ Gly val ser Met Sex Leu Ala 

1180 IIBS 

cat ttg gtc ate cat gaa tge cot cag etg tgt eag caa tot oat asoo 
llto JCs ^° S IS 



1200 



P^o 9at tgg eee tta ata get aat gta cca aga 3654 

^ Asp ASP Txp Pro Leu lie Ala Asn Val Pro 
1210 1215 

f"*" H?* act cat cca tgt cgc tgt ago acc acc tga 3699 
^« ™^ Pro Cys Arg Cys Ser Thr Thr 

"20 1225 1230 



<:210> 4 

t:211> 1232 

<212> PRT 

<213> Husa acuminata spp malaccensls 

<400> 4 



Met Ala Asp val Thr Pro Gin Ala Ala Ala Val Phe Ser Leu Val Asn 
^5 10 IS 

Glu He Pbe Asn Arg Ser He Asn Leu He Val Ala Glu Leu Arg Leu 
20 25 30 



Gin Leu Asn Ala Arg Ala Glu Leu Asn Asn Leu Gin Arg Thr Leu Leu 
3S 40 45 
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Arg Thr His Ser Leu l^eu Glu Glu Ala Lys Ala Arg Arg Met Thr Asp 
50 55 60 



Lys Ser Leu Val Leu Trp Leu Met Glu Leu Lys Glu Trp Ala Tyr Asp 
65 70 75 80 



Ala Asp Asp He Leu Asp Glu Tyr Glu Ala Ala Ala He Arg Leu Lys 
85 90 95 



Val Thr Arg Ser Thr Phe Lys Arg Leu He Asp His Val He He Asn 
100 105 110 



Val Pro Leu Ala His Lys Val Ala Asp He Arg Lys Arg Leu Asn Gly 
115 120 125 



Val Thr Leu Glu Arg Glu Leu Asn Leu Gly Ala Leu Glu Gly Ser Gin 
130 135 140 



Pro Leu Asp Ser Thr Lys Arg Gly Val Thr Thr Ser Leu Leu Thr Glu 
145 150 155 160 



Ser Cys He Val Gly Arg Ala Gin Asp Lys Glu Asn Leu He Arg Leu 
165 170 175 



Leu Leu Glu Pro Ser Asp Gly Ala Val Pro Val Val Pro He Val Gly 
180 185 190 



Leu Gly Gly Ala Gly Lys Thr Thr Leu Ser Gin Leu He Phe Asn Asp 
195 200 205 



Lys Arg Val Glu Glu His Phe Pro Leu Arg Met Trp Val Cys Val Ser 
210 215 220 



Asp Asp Phe Asp Val Lys Arg He Thr Arg Glu He Thr Glu Tyr Ala 
225 230 235 240 



Thr Asn Gly Arg Phe Met Asp Leu Thr Asn Leu Asn Met Leu Gin Val 
245 250 255 



Asn Leu Lys Glu Glu He Arg Gly Thr Thr Phe Leu Leu Val Leu Asp 
260 265 270 



Asp Val Trp Asn Glu Asp Pro Val Lys Trp Glu Ser Leu Leu Ala Pro 
275 280 285 



Leu Asp Ala Gly Gly Arg Gly Ser Val Val He Val Thr Thr Gin Ser 
290 295 300 
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Lys Lys Val Ala Aap Val Thr Gly Thr Met Glu Pro Tyr Val Leu Glu 

3" 315 320 

Qlu iKSu Thr Glu Asp Asp Ser Trp Ser Leu He Glu Ser His Ser Phe 
325 330 

Arg Glu Ala Ser Cys Ser Ser Thr Asn Pro Arg Met Glu Glu He Gly 

345 3S0 

Arg Lys lie Ala Lys Lys He Ser Gly Leu Pro Tyr Gly Ala Thr Ala 

360 365 

Met Gly Arg Tyr Leu Arg Ser Lys His Gly Glu Ser Ser Trp Arg Glu 

375 380 

val Leu Glu Thr Glu Ttor Trp Glu Met Pro Pro Ala Ala Ser Asp Val 

Leu Ser Ala Leu Arg Aig Ser Tyr Asp Asn Leu Pro Pro Gin Leu Lys 

410 ^ 

Leu Cys Phe Ala Phe Cya Ala Leu Phe Thr Lys Gly Tyr Arg Phe Arc 

425 430 

tys Asp Thr Leu He His Met Trp He Ala Gin Asn Leu He Gin Ser 



43S 440 



445 



Thr Glu ser Lys Arg Ser Glu Asp Met Ala Glu Glu Cys Phe Asp Asp 

455 45Q 

Leu val cys Arg Phe Phe Phe Arg Tyr Ser Trp Gly Asn Tyr Val Net 
*®® ^■'O 475 480 

Asn Asp Ser val His Asp Leu Ala Arg Trp Val Ser Leu Asp Glu Tyr 

490 495 

Phe Arg Ala Asp Glu Asp Ser Pro Leu His He Ser Lys Pro He Arp 
500 505 510 ^ 

His Leu Ser Trp cys Ser Glu Arg He Thr Asn val Leu Glu Asp Asn 
515 520 525 

Asn Thr Gly Gly Asp Ala Val Asn Pro Leu Ser Ser Leu Arg Thr Leu 
330 535 540 
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Leu Phe Leu Gly Gin Ser Glu Phe Arg Ser Tyr His I«u Leu Asp Arg 

555 

Met Phe Arg Met Leu Ser Arg He Arg Val Leu Asp Phe Ser Asn Cys 
565 S70 

val He Arg Asn Leu Pro Ser Ser Val Gly Aen Leu Lys His Leu Arg 

530 5g5 = 



590 



Tyr Leu Gly Leu Ser Asn Thr Arg He Gin Arg Leu Pro Glu ser Val 

600 

Thr Arg Leu Cys Leu Leu Gin Thr Leu Leu Leu Glu Gly Cys Glu Leu 

eye Arg Leu Pro Arg Ser Met Ser Arg Leu Val Lys Leu Arg Gin Leu 

"° «35 640 



Lys Ala Asn Pro Asp Val He Ala Asp He Ala Lys Val Gly Arg Leu 
®*5 650 655 

He Glu Leu Gin Glu Leu Lys Ala Tyr Asn Val Asp Lys Lys Lys Gly 



670 



His Gly He Ala Glu Leu Ser Ala Met Asn Gla Leu His Gly Asp Leu 

680 ^85 

ser He Arg Asn Leu Gin Asn Val Glu Lys Thr Arg Glu . Ser Arg Lys 

695 700 

Ala Arg Leu Asp Glu Lys Gin Lys Leu Lys Leu Leu Asp Leu Arg Trp 

715 720 



Ala Asp Gly Arg Gly Ala Gly Glu Cys Asp Arg Asp Arg Lys Val Leu 
725 730 

Lys Gly Leu Arg Pro His Pro Asn Leu Arg Glu Leu Ser He Lys Tyr 



750 



Tyr Gly Gly Thr Ser Ser Pro Ser Trp Met Thr Asp Gin Tyr Leu Pro 



760 



765 



Asn Met Glu Thr He Arg Leu Arg Ser Cys Ala Arg Leu Thr Glu Leu 

775 780 

Pro cys Leu Gly Gin Leu His He Leu Arg His Leu His He Asp Gly 

-795 800 
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Met Ser Gla Val Arg Gin lie Asn Leu Gin Phe lyr Gly Thr Gly Glu 
^'^^ 810 815 

val ser Gly Phe Pro Leu Leu Glu Leu Leu Asn lie Arg Arg Met Pro 

825 830 

ser Leu Glu Glu Trp Ser Glu Pro Arg Arg Asn Cys Cys Tyr Phe Pro 
835 840 845 

Arg Leu His Lys Leu Leu lie Glu Asp Cys Pro Arg Leu Arg Asn Leu 
" 855 860 

Pro Ser Leu Pro Pro Thr Leu Glu Glu Leu Arg He Ser Axg Thr Gly 

875 880 

Leu val Asp Leu Pro Gly Phe His Gly Asn Gly Asp Val Thr Thr Asn 

890 895 

val ser Leu Ser Ser Leu His Val Ser Glu Cys Arg Glu Leu Arg ser 

905 

Leu ser Glu Gly Leu Leu Gin His Asn Leu Val Ala Leu Lys Thr Ala 

920 925 

Ala Phe Thr Asp Cys Asp Ser Leu Glu Phe Leu Pro Ala Glu Gly Phe 

935 940 

Arg Thr Ala He Ser Leu Glu Ser Leu He Met Thr Asn Cys Pro Leu 

955 960 

Pro Cys Ser Phe Leu Leu Pro Ser Ser Leu Glu His Leu Lys Leu Gin 

970 



Pro cys Leu Tyr Pro Asn Asn Asn Glu Asp Ser Leu Ser Thr cys Phe 

985 990 

Glu ABU Leu Thr Ser Leu Ser Phe Leu Asp He Lys Asp Cys Pro Asn 

1000 X005 

'^'^ Itlc ^^'^ Ills io20 

^25 ^030 ^ ^'"^ 



1035 
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^""^ ?Jjo «^ Asn Cys 



1050 



''^ Jgs JJ;^^ Ser Leu Val Olu Val Asn Asa Ser 



1065 



ser Asp^ Thx Oly Keu Ala Phe^ Asn He «xr Arg Met Arg Arg 

Arg Thr^ aiy Asp Asp Gly Leu^ Met Leu Arg His Arg^ Ala Gin Asn 

t^o ^l*^ «i« I-eu Thr Phe Leu Gin 

iios 1110 

Phe Leu^ Lys He Cys Gin Cys^ Pro Gin Leu Val Thr Phe Thr Oly 



1125 



"^^^ JSd ^" ^" ser Leu Gin He Leu 

1135 1140 

His lie val Asp cys Pro Asn Leu Glu Val Leu Pro Ala Asn Leu 

"SO 1155 

hJo Arg cys Pro 

1165 1170 

^« P~ Gly Gly Val Ser Met Ser Leu Ala 

1180 1185 



His Leu^ val He His Glu C^s Pro Gin Leu Cys Gin Arg Cys Asp 

"55 1200 

So ^* *^ 



1215 



^0 ^^"^ *^ J225 *^ ^^"^ ^ 



1230 
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